As part of one of my PhD chapters, I recently (12 February - 3 March) organised and ran a three week IDathon on iNaturalist for observations of plants in three biodiversity hotspots in Western Australia (WA). I recruited ~60 ‘experts’ with knowledge of WA plants: these included lots of professional taxonomists and botanists, but also amateur experts and herbarium volunteers, and included both pre-existing iNat users and new recruits. During the event we reviewed almost 12,000 observations (with another 5,000+ from the three focus areas already identified by at least one of these experts before the event began). It will be a few months before I get the chance to crunch all of the data, but I thought it’d be interesting to look at a small case study now.
Botanist + taxonomist Brendan Lepschi is an expert in Melaleuca, and he made 322 identifications during the event, so I assessed each observation he IDed.
As brief background, Melaleuca is a genus of trees and shrubs in the family Myrtaceae, and is one of Australia’s most diverse genera, with ~250 described species. Western Australia is the centre of diversity for the genus, with ~200 species known from the state (almost all natives, plus a small handful of naturalised east coast species). Diversity is also generally very high even at smaller spatial scales, with often 40-60 (or even more) species found in a single national park. Broadly, it is also a genus for which identification is generally difficult, especially from photographs only, and indeed for some species groups, identification is hard even with a specimen in the hand without fertile material. There are certainly plenty of species in the genus which are easily recognised and identifiable to species from photographs, but there are also a great many that are either very difficult to identify from photographs, requiring expert knowledge of the group (the few keys that exist are very daunting to use given the huge number of species and couplets, and the fact you need fertile material is really important for many species IDs), or indeed are impossible to ID from photographs alone. Many of these latter examples fall into a few species groups/complexes where identification is notoriously difficult, and indeed for these taxa, there are numerous herbarium vouchers also misidentified or that now have misapplied names due to delays in re-detting after taxonomic revisions.
So overall, Melaleuca is on the tough end of the spectrum when it comes to identification, and it would be a reasonable assumption that identification accuracy on iNaturalist would not be particularly high, especially for a region like WA with very high diversity (and relatively few identifiers compared to the eastern states).
(as a quick aside here, on iNat, Melaleuca is Melaleuca sensu latu. There are a number of segregate genera [Beaufortia, Calothamnus, Regelia, etc] that about ten years ago were all transferred into Melaleuca. The Western Australian herbarium and the Australian Plant Census still treat these genera as valid, but POWO and iNat lump them into Melaleuca. Brendan’s expertise is in Melaleuca sensu strictu, and that’s what almost all of his 322 IDs were of aside from a very small handful of exceptions).
So now to the stats. First of all, of the 322 observations Brendan IDed, 4 of them were not actually Melaleuca, meaning 99% of the observations were correctly identified at the genus level. There are other Myrtaceae genera that can be easily confused with Melaleuca, e.g., Kunzea, so this is a nice statistic even though it may not seem especially impressive at face value.
220 observations were identified to species before Brendan’s IDs. He confirmed 175 of these as correct, ie 80% of observations identified to species were confirmed as correct. As for the other 20%, around half were corrected from one species to another, and the other half were pushed back to genus as a species ID wasn’t possible from photographs/the photographs provided.
Of the 102 initially only identified as Melaleuca, 4 were corrected to a different genus, 15 were confirmed as genus Melaleuca but were not identifiable any further, 79 were refined to species by Brendan, and 4 were cases where the observer had identified the record to species X, an identifier had added an ID of species Y and pushed the record back to genus, and Brendan confirmed species Y as being the correct ID. Of the 23 observations that were corrected from species X to species Y, 10 of them were cases involving one of the difficult species groups, in which even herbarium specimens are misidentified or have misapplied names, such as this example:
.
So overall, 80% of the 322 observations were identifiable to species, and before Brendan reviewed the records, 80% of observations already at species were correctly identified. After his review, the observations covered 57 different species, including a few new species for iNat.
These are pretty impressive results given the difficulties involved in identifying this genus as I discussed above - high diversity, lots of sympatry, daunting keys, importance of fertile material - and certainly these stats are higher than what I expected before the event.