iNaturalist Licensed Observation Images in the Amazon Open Data Sponsorship Program

Yes this was intentional. We last generated the export on the 6th, but when doing a final review I noticed that when we added user names, I failed to trim whitespace from the usernames which caused some newlines to make it into the file, which caused import commands to fail. So I just removed the newlines from the observers file, but otherwise the data from all files was generated on the 6th.

since metadata files will be generated on a monthly schedule, is the 15th going to be the date each month when they will be generated, or will it be some other date?

We haven’t set an exact date yet. I’m still debating wether it’s better to generate the files on a fixed date (say the 1st or 15th of each month) or on a fixed day (say the first Saturday of each month). The former is easier to explain and maybe better for user expectations, the latter is easier for us to plan other background job around. Does anyone have a preference?

have you all given any thought to either adding observation id to the observation metadata file or adding observation UUID to the iNaturalist CSV export?

Ideally for us, we’d start using observation UUIDs everywhere right now and avoid auto-incrementing integer IDs entirely. Since this isn’t practical or possible to do all at once (we will be making this switch over time), of the two options I’d prefer to include observation UUIDs in the CSV exports. UUIDs are currently available via the API. Auto-increment integer IDs are a problem in that they have an implied meaning (sequence) that UUIDs do not. For various reasons we only want identifiers to identify resources and not have any additional implied meaning, which is why we have chosen to leave them out of this export and do not plan on adding them.

As an aside - we did leave photo IDs in the export since we currently store photos by ID, therefore the ID is needed to construct their URLs. Again, in an ideal world we’d automatically switch to UUIDs for photos as well, but there is a lot more work needed to be done to do the same with photos. This is something we may do in the future

2 Likes