Don't use computer vision

pmeisenheimer · July 5, 2020, 5:49pm

There are guides of various types in the Guides link (found under More on the header). They are not comprehensive but they are helpful.

The response to posts on iNat can be variable. I actually joined iNaturalist because I decided to make myself a Covid project to learn about bumblebees. I posted a few species and figured at least one of them would get a response. In the meantime I started posting other photographs (old and new) and rediscovered interests that had been dormant for years. No regrets on the bees, which alas remain unidentified. I’m sure somebody will weigh in eventually.

This forum is also a nice bonus. Some wonderful folks and some interesting stuff.

EDITED TO CORRECT LINK

acertsuga · July 6, 2020, 2:37am

Thanks! Yup, that’s exactly it. There are a zillion wiki implementations for data repositories, and I would love it if iNat implemented its own from which to pull useful taxonomic information in place of the Wikipedia blurb. Thanks for finding those discussion pages for me! I tried a couple of different searches and wasn’t finding the right things.

langlands · July 14, 2020, 4:29am

The odds of being wrong are less than when is not sure and in the next learning iteration will probably learn it, I think is a good balance, otherwise it becomes the problem the OP refers to. In my experience of almost 50 observations where I tried the CV prompt and there was not “we’re pretty sure” suggestion, the other suggestions were simply wrong.

langlands · July 14, 2020, 4:33am

Excellent, thank you. Hopefully curators and identifiers are aware of this

pfau_tarleton · July 17, 2020, 12:15pm

With the published books, there’s some incentive (fame and fortune!) for a person, or persons, to spend years of time putting together those descriptions. And the descriptions in those books are usually only useful regionally. We can’t copy copyrighted content from books and put it on iNat. And the content that you do find on iNat comes from the users of iNat–like yourself–and others that contribute to Wikipedia articles. There’s no staff for adding content to iNat itself. Hope that helps! It would be nice to have a giant Swiss army knife, but the more tools added to the knife, the more unwieldy it becomes–even if it’s possible to add the tools and financially feasible to build and maintain it.

matthias55 · July 17, 2020, 6:26pm

For training the model, if you have more than 1000 photos for a species/taxon, are only photos from RG observations used?

And a different question, is there any way for an iNat user to know which species are on the bubble, e.g. close to having 100 photos and thus able to be used to train the model?
If there was a way for me to know which species were close and occur in my area, I could try to take more photos of those species so they could be included for the new model.

tiwane · July 17, 2020, 9:41pm

They’re chosen pretty randomly, RG/not RG is not a factor.

Maybe using the API? Either way, I personally don’t think training the computer vision model should be a priority for users. Have fun exploring and observing what you want to observe. But to each their own. :-)

sbushes · July 20, 2020, 10:59am

I’m curious, why not?

For me, this is one of the primary incentives to use iNaturalist :

FILLING IN THE BLANKS
Because its like a kind of puzzle with missing pieces…it can see some genera but has no data about another …so needs the blanks filling in. If I find a local species it doesn’t recognise to genus or even family, I have been actively aiming to achieve 50 observations of it to try and get it recognised.
FUTURE FOCUSSED
It feels like a long term goal. Unlike helping to identify organisms on other sites which might languish or anyway not be then entered into a dataset, here it feels like we’re all chipping away at something bigger which (if it ever became accurate enough) could potentially have serious impact down the line… in opening up and supporting society’s awareness and ability to perceive the natural world…and in turn, the larger ecological implications.
HELPING FIX THE BROKEN BITS
Similarly to number 1, incorrect CV issues which propagate feedback loops of misidentification feel like they are holes that need fixing and something we can actively contribute to as users through correct identification.

sbushes · July 20, 2020, 12:28pm

This response might be slightly misleading, unless I’m misunderstanding the other posts around this. RG is not a factor in training, but it is in testing. ( many might not know how ML works…so understanding of the word “training” might anyway encompass testing for some readers ) …That might sound pedantic! But for me, as mentioned, its a core incentive, so I was happy to learn more about how it was working this week ( and will be happy if someone corrects this with further info).

My current understanding :

HELPING FIX MISSING SPECIES with less than 100 photos
If there are less than 100 photos of a species… like @matthias55, we can try to help train it.
We do not need to reach RG, just accumulate the 100. This should be roughly visible through exploring observations though, no need to use API(?)…
e.g. on a blank I think I’ve nearly filled… :
https://www.inaturalist.org/observations?place_id=any&subview=table&taxon_id=451684
30 obs with 1-6 photos should be approaching the necessary 100 total.
Currently, CV suggest is some sort of ant for this species, so wrong taxonomic order entirely…and a nice sense of achievement to fix I think!

HELPING FIX INCORRECT SPECIES with over 1000 photos
If however, over 1000 photos already exist and a user rather wishes to help fix a recurrent error with the CV, adding more won’t necessarily help… this is more about ensuring there is a cleanliness to the existing test dataset, in which case, helping with quality control, as an identifier, might contribute to resolve the issue more directly and prevent more incorrect obs being placed in the dataset.
A core issue at present - as visible in the computer vision clean-up wiki - seems to be issues which have been created due to this feedback loop of misidentification >> wrong auto-suggest >> further misidentification.

HELPING FIX INCORRECT SPECIES with 100-1000 photos
If there are between 100 and 1000 photos for the species set, helping with identification quality control and increasing the amount of training data both seem like valid ways to help. Both should contribute to overall accuracy. If I understand correctly.

system · September 18, 2020, 12:29pm

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Problems with Computer Vision and new/inexperienced users General	134	4671	December 27, 2021
Assuming it is right, why wouldn't I use the AI suggestion? General	29	1132	September 5, 2023
Signifying User IDs versus Computer IDs General	16	1770	June 18, 2019
Computer vision IDs should not be eligible for Community ID opt-outs Feature Requests declined	43	2538	October 5, 2023
Opinion on the computer vision icon? General question	61	1962	October 14, 2023

Don't use computer vision

Related topics