Computer Vision should take into account fraction identified to species

This sounds related to this common issue (my comment on New Computer Vision Model (v2.17) with over 1,000 new species!):

I think there are probably ways that the CV could be optimized to reduce this, but I feel like it would be pretty complicated and take a bunch of problem-solving to figure out how to do it well.

The main question I think is how do you make it aware that other similar species exist, if there aren’t enough observations of them for those species to be added to the training pool? If it isn’t aware of the species then it doesn’t know whether they look identical to the species it knows vs. existing but being very distinct, or even whether they exist at all.

2 Likes