The analysis of location names (aka locality notes) is quite confusing in the video. Initially, it’s included as the ninth item in the list of poor quality data, but the rest of the video only lists eight items (e.g. in the data quality concerns ranking). Towards the end of the video, there is a brief analysis of location names, but it doesn’t really address the essential problem with this item of iNaturalist data. This is rather disappointing, because many verifiers have complained a lot about this issue over the years, and there’s really no question that these complaints are generally well-founded, as can be inferred from this iNaturalist help topic: why is my observation’s location name incorrect? The fundamental problem here is that most locality notes are derived directly from the coordinates via reverse geocoding, so they have no value at all when it comes to verification - i.e. if the coordinates are wrong, then any automatically added locality notes must also be wrong, no matter how superficially accurate they appear to be. The special case of “vague” location names is really a red herring. What really matters is that, by default, most iNaturalist location notes aren’t independent items of data, so they can’t (or shouldn’t) be used for the verification of coordinates.
Now of course, if we could somehow guarantee that the coordinates always came from a reliable source, this wouldn’t be so much of a problem. But there are many different ways in which the primary location data can be entered (manually, via GPS, from geocoding, photo metadata, etc), and iNaturalist doesn’t currently record its origin. Thus, since this data has no obvious provenance, and is subject to various errors (both human and mechanical), it’s vital that an independent means of checking its validity is provided. This is why it’s so important for the recorder to manually enter their own location name, and that they choose something which doesn’t just look like a normalised address produced by a machine. Ideally, it should be a brief description based on a few local points of interest which demonstrates the recorder’s familiarity with the observation site. For example, “Field at end of footpath by SomeVillage church” is much better than “SomeVillage, SomeTown, England GB XY12 Z34”, because it uses features that can be easily found on the sort of maps provided by e.g. the Ordinance Survey or OpenStreetMap (as opposed to, say, Google, which is much more business-oriented). Needless to say, adding such information for a large number of records will be quite time-consuming as it often requires quite careful thought. But that’s precisely the point: if there’s no pain, there’s no gain.
This issue (amongst many others related to iNaturalist) was discussed several years ago on the National Forum for Biological Recording. However, since many people don’t do facebook, I will reproduce some of the relevant parts here:
Natalie Ann Harmsworth: My bug-bear so far has been with location names being auto generated. So far at least, many of the locality names used bear no close relation to the spatial references given. Is there a way to flag which location names are recorder inputted vs those that are auto generated? At present instead of the location names giving context to sightings they are confusing and make you think that there must be a problem with the record (grid reference and location name do not match). I am guessing this is going to [be] more of a problem in more rural locations…
Roger Morris: I agree wholeheartedly - we use location names as part of the location verification process - if that name is miles away then it creates big problems.
Natalie Ann Harmsworth: Quite! Because there is no direct link between iRecord and iNaturalist any queries regarding locations would have to be posted at source on iNaturalist. Practical if you only have a few to do, but if you are verifying 100s or 1000s of records I cannot see people taking the extra time to do this. I started off rejecting records if the location and grid clearly were are odds with one another, but that results in a lot of ‘lost’ records so will investigate if commenting on iNaturalist is worthwhile. In effect it isn’t the recorder’s fault if iNaturalist adds an incorrect location name - it is the system that is at fault and the fact iNaturalist does [not] consider location names so important.
Automatically adding reverse-geocoded location names seems quite appealing at first, but in the vast majority of cases, it either adds no value whatsoever or is actively misleading. If the accuracy circle is, say, 50 metres, what is the point of giving a superficially accurate location name several kilometres away? This sort of cosmetic fluff is just an attractive nuisance.
These issues aren’t restricted to iRecord. When I’m reviewing observations on iNaturalist, I don’t just want to make identifications: I would like to verify the location data as well. However, the site doesn’t really offer any help with this, as there’s no concrete mechanism, nor any agreed conventions regarding how to deal with inadequate/ambiguous location data. There seems to be a common assuption that the coordinates are somehow self-evidently correct in most cases, and so don’t deserve the same level of scrutiny that identifications attract.