Psst - New Vision Model Released!

ahospers · March 20, 2020, 3:24pm

Not training!! But suggestions, analysig… but after training in the observation. Currently i thought it is using the first photo of an observation for the suggestions and skips all other photos.
I could not find it but it seemed i missed several posts on CV, AI training.

Are there more tips on the way CV, AI works (Cropping can definitely improve results.)

–
https://www.inaturalist.org/computer_vision_demo
http://www.vision.caltech.edu/publications/publications.html
http://www.vision.caltech.edu/archive.html
https://vision.cornell.edu/se3/
https://vision.cornell.edu/se3/publications/
https://merlin.allaboutbirds.org/
https://merlin.allaboutbirds.org/
https://sites.google.com/visipedia.org/index/publications

https://forum.inaturalist.org/t/use-computer-vision-to-annotate-observations/3331
https://forum.inaturalist.org/t/what-image-s-are-used-for-training-computer-vision/3307/6

theorickert · March 21, 2020, 1:35am

Sorry, my previous suggestion probably went to the wrong address (it’s on improving the AI suggestions).
One idea that might be more appropriate here is to allow a smaller numbers of photos for selected taxa in the AI training. The reasoning is that some taxa are much more unique than others and this might allow some of the rarer taxa to get in. Just an idea - I don’t know how much trouble that would be, or if that can even work in your process.

dkaposi · March 27, 2020, 12:07pm

there is a thread on a related topic, using the CV to populate annotations
https://forum.inaturalist.org/t/use-computer-vision-to-annotate-observations/3331

schizoform · March 30, 2020, 4:08pm

Hi! I had a couple questions related to the new computer vision model and thought I’d float them here (I’m happy to relocate if there’s a better place).

First, is there any particular reasoning behind only running cv (edit: cv prediction) on the first image in a set? I’ve started polling the cv on each photo in the uploader prior to merger, but it’d be nice to be able to access this info after the fact to guide identification of mine and other’s photos.

Second, I’m revisiting my old non-research-grade observations in light of the new cv, and ran into a quandry. If a species I couldn’t previously identify now shows up with a reasonably strong cv suggestion, I’m tempted to add it. If a user previously suggested the species, I think this would promote to research grade. I think this matches intent: The two identifications should be independent, bc sub-RG observations don’t feed to the model except when sub-RG only bc cultivated. Still, it feels a bit weird.

kiwifergus · March 30, 2020, 7:23pm

In the first question, CV is trained on all observation photos, but when you are getting a suggestion on an observation it only looks at the first photo. In the uploader, before you merge a group of photos into one card, you can use the CV suggestions for each card independently, so it’s a way to see if any of the photos might offer something different.

aisti · April 16, 2020, 8:13pm

Suggestions welcome

Figure out a way to train with the location and date

People keep bringing up modifying the input images, but you can also just provide these as secondary inputs to the model architecture. For example if the architecture is currently:

[image] → [convolutional layers] → [output layer]

You could make it:

[image]-->[convolutional layers]-->[ concatenated
[location]------------------------> ........layer ]-->[ output layer ]

And location could simply be represented by the real value of the latitude and something like the cosine of the longitude after scaling from -pi to pi. (This enables it to wrap around.) You could do similar things to enable dates to wrap, giving a seasonality input value.

edit to add: The wrapping method I gave makes it so that 90W is exactly as far from 0 as 90E, but doesn’t capture that those are opposite from one another. If you use both the sine and the cosine you get both pieces of information. Intuitively, the sine and cosine denote a specific angle that points to a specific location on a circle (where the circle is a latitude line). Likewise, the sine of a date is about 0 at solstice and the cosine is about 0 at equinox, and the pair of them together tells you exactly where in the annual cycle you are.

I don’t know how useful this is / don’t have citations for its use, but it should be easy enough to try and test on a small dataset to see what happens.

tiwane · June 15, 2020, 8:13pm

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
New Computer Vision Model Released News and Updates	73	3273	May 20, 2024
New Computer Vision model released! News and Updates	45	5612	October 4, 2024
iNaturalist Updates for February 2023 News and Updates web , android-app , ios-classic-app , seek , monthly-update	10	1284	May 2, 2023
New Computer Vision Model Released - August, 2022 News and Updates web , android-app , ios-classic-app	21	1671	January 10, 2023
What I learned after training my own computer vision model on iNat's data General computer-vision	18	2121	October 14, 2023

Psst - New Vision Model Released!

Related topics