Where does the iNat-GBIF taxonomy cross-reference live?

I’d like to be able to match GBIF taxon identifiers with their respective iNaturalist taxon identifiers (example GBIF:5141342 = iNaturalist:49972). iNaturalist periodically dumps observations to GBIF, so somewhere there must exist a database or something to translate between taxon IDs. I understand the taxonomy backbones probably aren’t one-to-one (for example iNat may include a taxon that GBIF doesn’t) and that’s ok. Does anyone know a straightforward way to go between GBIF and iNat taxon IDs?

iNaturalist taxon pages do list other external taxon identifiers (example) including GBIF, but iNat’s get_taxa_id API endpoint doesn’t appear to return the GBIF ID.

Some possible solutions are to go through WikiData (as suggested by @pisum or maybe the ChecklistBank API, but I’d love to find a more direct go-between and ideally search on the GBIF taxon ID and get the corresponding iNat taxon ID.

Thanks for any help,

Will

1 Like

https://www.wikidata.org/wiki/Q1758032 a already given answer…

Welcome to the forum!
iNat doesn’t keep track of GBIF taxon identifiers for the purposes of data export. Instead it sends over the scientific name and GBIF uses its name matching tool.

Some taxa know their GBIF id through a taxon scheme, but this is not available in the API, see here.


Do you mean this?

If so, that is coming directly from Wikipedia, as you can see by the citation at the bottom. You can access that from Wikidata.

4 Likes

Assuming you have the species (scientific) name associated with the GBIF id, I think you can use the get_taxa endpoint q parameter to submit a query for the name and get the iNat id. You may not get anything in some cases though and you will have to sort through the results to get the corresponding name if there is one.

At least historically the GBIF ‘Taxon IDs’ were non-persistent and were identifiers for instances of name-strings, not taxon concepts. So, any linkages using these identifiers would have transient validity. That may have changed.

https://www.checklistbank.org offers a name based mapping between any of the datasets it contains.
iNaturalist is included via the monthly DwCA dumps with datasetKey 139831.

You can retrieve a taxon via the API by it’s original identifier, e.g. iNats Battus philenor is retrieved via https://api.checklistbank.org/dataset/139831/nameusage/https%3A%2F%2Fwww.inaturalist.org%2Ftaxa%2F49972

Personally I would have preferred just 49972 as the identifier, but iNat has decided to publish the DwC archive with the website URLs instead. Maybe that should be reviewed?

You can then list related names from any of the other datasets by using this API call (53147 being the GBIF Backbone datasetKey):

https://api.checklistbank.org/dataset/139831/nameusage/https%3A%2F%2Fwww.inaturalist.org%2Ftaxa%2F49972/related?datasetKey=53147

or vice versa from GBIF to iNat:
https://api.checklistbank.org/dataset/53147/nameusage/5141342/related?datasetKey=139831

Other popular dataset keys to play with:

  • COL Latest Release: 3LR
  • COL AC22: 9837
  • ITIS: 2144
  • WoRMS: 2011
  • GBIF: 115450
  • iNaturalist: 139831

The crossmapping is based on names, not taxa. We have also planned to work on a taxon concept mapping more like Avibase does, but that is not on our immediate priority list right now.

3 Likes

Thanks @jwidness for the tip regarding Wikidata. For anyone who needs it, here’s a Wikidata SPARQL query that takes a GBIF taxon id (“5141342” in this example) and returns the corresponding (1) iNaturalist taxon id and (2) ITIS TSN, if known.

Query form:

SELECT ?iNat_Taxon_ID ?ITIS_TSN WHERE {
  ?item wdt:P846 "5141342". # GBIF Taxon ID
  OPTIONAL { ?item wdt:P3151 ?iNat_Taxon_ID. } # iNaturalist taxon ID
  OPTIONAL { ?item wdt:P815 ?ITIS_TSN. } # ITIS TSN
  }

Endpoint URL:
https://query.wikidata.org/sparql?query=SELECT%20%3FiNat_Taxon_ID%20%3FITIS_TSN%20WHERE%20{ %20%20%3Fitem%20wdt%3AP846%20"5141342". %20%20OPTIONAL%20{%20%3Fitem%20wdt%3AP3151%20%3FiNat_Taxon_ID.%20} %20%20OPTIONAL%20{%20%3Fitem%20wdt%3AP815%20%3FITIS_TSN.%20} %20%20}

Thanks for that helpful context @cooperj !

Thanks @markus ! I hadn’t heard about checklistbank before this but it sounds promising. The example URLs are much appreciated!

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.