I’d like to do some computer vision development using my iNaturalist observations and have a request to make this easier. Ultimately, the specific problem is that I’d like to be able to link iNaturalist observations to the raw photos that I have on my computer. The only way I can imagine doing this retrospectively (and relatively efficiently for my 6000+ observations) is by scanning through the photo metadata and matching the timestamps and dimensions. However, the only method I’ve found to do this is by using the API to retrieve observations, then individually scraping the html from each photo page using a logged-in browser profile. This makes me a bit nervous because I’m accessing the site programmatically so frequently without the API - will that get me blocked if I let it run for all my photos??
In theory this should be a one-time use to link the photos with their respective observations, since in the future I can imagine a workflow where I post the observations using the API and store the observation IDs as I upload photos. But for now…
Ok, I read that page but I wasn’t sure if the requests-per-minute recommendations applied to general page access the same as direct API requests. Also, I suppose I did have a number of questions mixed in and could write a more isolated request, but my intent was to request that the API provide direct access to the photo metadata. Thanks for the reply!
The only way I can imagine doing this retrospectively (and relatively efficiently for my 6000+ observations) is by scanning through the photo metadata and matching the timestamps and dimensions. However, the only method I’ve found to do this is by using the API to retrieve observations, then individually scraping the html from each photo page using a logged-in browser profile
Good point; thanks! I was starting with the timestamps since the dimensions are mostly redundant, but I had forgotten that that part was in the observation json.