A found a few shells of a species that has ~10 observations on iNat and around 30 photos. Species tend to get added to CV and the geomodel at around 100 photos. Would it be a good idea to take a lot of photos (more than 10) with a lot of different backgrounds and angles to add to a single observation, with the intent to speed up the process of getting it into the CV? Are there any downsides to this?
Also, here’s an example of a few different photos I’ve taken of the same few shells. I haven’t uploaded them yet.
Apparently only up to 5 photos from an observation might be selected for training (could be fewer than 5 if randomized), so you’d need to observe many different individuals of the species.
If your photos represent most of the observations it’s trained on, it would then be optimized to your camera settings and photo choices rather than those of a bunch of different people. So that could be a downside compared to a more “natural” progression of observations being added to iNat.
I did this for 4 small marine bivalves: the Lunate crassinella, three-tooth carditid, flat cardita, and grey pygmy venus. I took 4 photos each of about 14 shells from each species (front & back on two different backgrounds) which is about 50 new pictures per species. The first two are now included in the CV model!
for example https://www.inaturalist.org/observations/168110734 (grey pygmy venus) https://www.inaturalist.org/observations/168123150 (flat carditid)
I’m hoping to get Asphondylia lacinariae (Liatris gall midge) into the CV set. I took pictures of multiple individual galls from multiple angles. There is a lot of variability in the shape. But then I realized that all the ones I photographed are on the same host plant, Liatris pycnostachya var. lasiophylla, which would effect the training. This year I’m going to try to find some on a different species of Liatris.
I tried to vary things like how zoomed in the picture is, the color and texture of the background (I was biased towards sand background and hand shots as that’s how the majority of seashells are posted) and I might have also messed with brightness and saturation but I don’t remember.
At the current tally, I have 100 out of 116 observations. At the time I did them it pushed it above the threshold for inclusion. Five years later, if it were not for them, it would still not be close to inclusion. I’d say it’s a very good thing to do.