I donât know if the CV has better clues and considers the inaccuracy rate of its suggestion in future. I think that it should be weighted too, instead of just training for the same loss on a new set (or entire set); it would be better to value as weighted loss as this is something the previous data failed to consider; maybe the new observation set identified something new in a rare species or such of the cases discussed above where there is not enough diversity. It would be a tight balance between code complexity and trusting new data being IDâed without hampering the accuracy of previous model suggestions. I donât know if the code for all this is on github!? Can someone confirm and direct me? Thanks.
another metric is: the misidentification rate for that species with others (we see on taxa page), i think that should be considered too when giving final decision.
I see a mismatch between the Geo-model and CV model often. I saw CV showing completely different continent species confidently and new users directly agreeing to it. Ofc I am not saying we should ignore cases of migrations or such in CV, but when there is never an observation of a species in that area or country, there could be a marker that can be shown below the CV suggestion (for example, it shows expected nearby in mobile apps as of now, maybe it could be added there ânot seen before in xyzâ too, where xyz can be area or country or continent)
Especially with the lack of explainable decisions in current CV ( I really hope we can get some SHAP or such of CV decision for power user), there should at least be a transparent confidence indicator of its suggestion even if they are sorted on such confidence (maybe enabled via profile setting to overwhelm always?), on mobiles I see green âVisually similar/expected nearbyâ for probably better suggestions. Still, there is no such clue on the desktop; maybe a class of colour intensity indicator to distinguish titles of species in CV suggestion can be better on the desktop.
Also on desktop, the AI suggestions are triggered only for first image (on mobile one can slide to new image and the new CV suggestions appear for that image), sometimes the first image can be zoomed out shot or bad and ability to retrigger on desktop would be very helpful too.
When CV is making a suggestion, maybe we can include the species that are not in the CV right below the suggestions like the above example of algae; I feel it is the responsibility of CV to do it to reduce mis-IDs, which will only flywheel if not controlled (where the new CV version learning from such mis-IDs) now.
For example, this woodpecker has very few observations and is not included in CV, https://www.inaturalist.org/taxa/17964-Dendrocopos-assimilis. Still, for that range of woodpeckers, it would be really helpful if this suggestion popped below CV suggestions (something like species not added but probable woodpeckers) just to make users consider them too when using the CV tool, ofc it is totally hard when there is no ID or completely different things can look similar (but again maybe restriction in suggestions case as next point), but at least if it is a Red bug, we can show an indicator showing there are lot of redbugs possible in that area and not in CV, similarly as with above woodpecker when CV has already zoned confidently on woodpeckers suggestion, this non-included taxon is better shown right alongside CV recs.
Finally, for a new user (their global count or for that taxa?), it would be better to show higher-level restricted suggestions by default (maybe they can expand with another press or option), more so when the CV is not confident or if that species has a higher chance of past mis-IDs.