Provide a way to filter observations by disputed IDs

I would like the ability to filter observations by IDs that have been disputed. (apologies if this has already been proposed, I searched for it but didn’t find anything. There was a discussion about finding your own “maverick” IDs by typing into the URL, but this is a different proposal. If there is a way to type directly into the URL and achieve what I’m proposing here, please let me know as it would solve this problem)

Often I am working through higher level taxonomic groups and find disputed IDs that have been kicked up to a high level and need additional IDs to reach genus or species level.

For example, suppose that an observation is (incorrectly) labelled winterberry. It is corrected to honeysuckle. Now if the original observer doesn’t take any action, the observation will sit at dicots (class) until two additional identifiers find it and agree with the correct ID. Sometimes it takes months or even years to get the 3rd identifier.

I think what is happening is that the observation gets buried in large numbers of higher level IDs, some or maybe many of which are just low quality observations (blurry photo, picture of a tree trunk, or a single leaf) or hard to identify past, say, dicots. The key point is that a lot of the disputed IDs would not be that hard to confirm if they could be found easily. Having the ability to filter for these observations would decrease the time that the disputed observations spend in taxonomic limbo.

1 Like

@matthias55 I changed this to a general topic instead of a feature request, because I’m pretty sure the following will do what you are asking. But if not, we can change it back to a more targeted feature request.

[EDIT: turns out that the following may not be a reliable long-term solution to this request, so converted it back to a Feature Request.]

This will work in Identify mode (not in explore Observations mode), but that is probably where it is best used anyway. If you follow this link:

https://www.inaturalist.org/observations/identify?order_by=updated_at&order=asc&lrank=phylum

You will see the Filters set for

  • lowest rank of observation ID = Phylum (change to whatever rank you like)
  • sorted by date the observation was last updated, oldest first

Then if you manually add

  • &identifications=most_disagree

to the search URL, it will restrict to a much smaller set (I currently see 858 at Phylum) where most of the IDs added were disagreeing IDs.

I think that is the kind of search you are looking for, but again, if not, we can discuss a more targeted feature request. I know the filter interfaces are being reviewed for possible improvements, so maybe this is one that will become a regular filter option.

The other available options for this parameter are

  • &identifications=some_disagree
  • &identifications=most_agree

Those didn’t seem to have much effect for this particular use case.

10 Likes

I would give this two hearts if I could. Can you add it to the search wiki? I would do it but I don’t know what the difference between “some disagree” and “most disagree” is. Testing it out on plants, “some disagree” seems just to give me items where there is only one ID.

1 Like

We can add it, but I have to confess I am not sure about the differences either, or whether this functionality will even be continued, after reading this old topic. I just know it seemed to work for this particular use case.

thank you very much!

hmm, I think I replied thank you too soon.
I tried this and it doesn’t seem to capture most of the disputed IDs.
e.g. with this link:
https://www.inaturalist.org/observations/identify?iconic_taxa=Plantae&order_by=updated_at&order=asc&lrank=class&place_id=42&identifications=most_disagree

I only get 9 results.

If I take off the “&identification=most_disagree” there are 7729 results.
I would guess that all the “disputed” IDs would yield at least hundreds of results.

Even if I go down to family level (rank high = family). there are only 15 hits in all of Plantae. There should be 1000s here.

1 Like

@matthias55 Yeah, as mentioned in the old topic I linked to above, it turns out that the most_disagree functionality is based on an outdated model of identifications in iNaturalist, and may soon be discontinued altogether. So I will go ahead and convert this topic back to a Feature Request.

A couple of questions:

I notice that your link restricts the search to just Pennsylvania. I assume that you are expecting to see higher numbers just within Pennsylvania?

Can you post a sampling of links to observations that you think the search should be capturing but is not?

Thanks!

Hmm, I’ve worked PA, OH, IN pretty heavily. I would expect all these states to have similar levels of disputed IDs because they have enough folks going through the flora observations and catching mistakes.
I would posit that it would hold across the US and Canada and really I would expect that you’ll find this level of disputed IDs in any taxonomic group where someone is working to catch errors.

Some examples of what I would like the filter to catch:
https://www.inaturalist.org/observations/7055140
https://www.inaturalist.org/observations/7066901
https://www.inaturalist.org/observations/7045167
https://www.inaturalist.org/observations/7078137
https://www.inaturalist.org/observations/7230415

these are all situations where an initial ID is provided and then disputed by 1 or more identifiers, keeping the community ID
higher until enough additional identifiers find it.

Here’s one that’s slightly different:
https://www.inaturalist.org/observations/7399915

The identifier disputed the initial ID, but didn’t propose a low level ID. “e.g. it’s not what you said, but I don’t know what it is.”

I would like the filter to catch this as well.

Here’s one that got stuck at family level (and has been stuck for 3 years, though it’s easily IDed as glossy buckthorn)
https://www.inaturalist.org/observations/762205

Thank you!

1 Like

Sorry, I wasn’t very clear with my first question. Is the number of disputed ID observations returned by the search a lot lower than you were expecting to see within Pennsylvania?

I think you are saying yes – I just wanted to make sure it wasn’t a result of having inadvertently limited the search to just that state.

Yes, typing “&identifcation=most_disagree” yields far fewer results than a query that would find every time someone disagreed with an ID. It’s also not just PA:
E.g. I did the same search for California plants at the phylum level.
32K results without “&identification=most_disagee,” 50 with that extra search term.

if I go to class, it is 70K without, 118 with.

We’re talking maybe a 2 orders of magnitude difference between expected and yielded.

I briefly have looked for a pattern in the 50 results that were returned and I can’t find anything obvious. The dates range from 5 years ago to as recent as a month, so it’s not like just the older observations are getting returned.

1 Like