Should an observation's initial ID count the same as subsequent IDs?

joe_fish · June 20, 2019, 3:46pm

I feel like the initial ID should count for less when subsequent identifications from multiple users argue against it. This happens ALL THE TIME, and it can be an enormous task to correct these simple mistakes in taxa that are poorly curated on here.

sedgequeen · June 20, 2019, 6:40pm

“The threshold is >2/3, this one is at 2/3”

So in rarely identified taxa, like grasses, the standard is (in practice, not in theory) 3/4. Unrealistic, but at least I know what’s going on.

Obviously, I personally would like the standard to be 2/3 or more, not more than 2/3. Sigh.

loarie · June 20, 2019, 6:47pm

the reason we set this threshold to >2/3 was so that in the very common sequence of IDs:

user a: incorrect id of species 1
user b: incorrect id of species 1
user c: correct id of species 2

the observation would be rolled back to the common ancestor of species 1 and species 2 as opposed to being research grade at species 1

joe_fish · June 20, 2019, 7:14pm

But perhaps you can have an exception for…

user a: incorrect
user b: correct
user c: correct

where user a doesn’t count against the threshold needed to reach research grade

joe_fish · June 20, 2019, 7:18pm

This is also germane to the topic I brought up recently:

https://forum.inaturalist.org/t/users-opting-out-of-community-ids-can-lead-to-inaccurate-data-points/4315/5

…wherein user A (who had opted out of community ID) and user B had misidentified and users C-F provided the correct ID. Since this wasn’t above 2/3, user A’s original misidentification was showing up on the map for the incorrect species and could apparently only be remedied by either recruiting more identifiers or converting some of the misidentifications. Again, for a taxon where few contribute to curating and identifying, this can be burdensome.

cmcheatle · June 20, 2019, 7:21pm

I dont feel that there should ever be an assumption built into the calculation that an identification is wrong, or that it is correct. If it needs to be outvoted so be it.

joe_fish · June 20, 2019, 7:37pm

Statistically, though, it seems far more common for the initial ID to be wrong than for multiple subsequent identifiers to be incorrect. Maybe iNat can put a number to that, but I’d guess that its perhaps a 50:1 ratio, maybe more. So if two identifiers are all that’s needed to reach research grade in the absence of misidentifications, I don’t think that a statistically likely misidentification from the observer or initial identifier should count against the efforts of subsequent identifiers. There’s undoubtedly a large backlog of observations that would benefit from this.

joe_fish · June 20, 2019, 7:41pm

And, of course, if the user responsible for the first ID is in fact correct in their determination, they can always recruit additional users to support their contention and swing the vote their way.

jdmore · June 20, 2019, 8:12pm

Or 2/2, which I encounter a lot.

sedgequeen · June 20, 2019, 9:10pm

@loarie, I can see the problem with observations where observations go wrong, wrong, right.

DianaStuder · June 20, 2019, 10:20pm

That wrong, right, right, RIGHT pattern is making the City Challenge IDs burdensome to clear. Especially if the original wrong never comes back to put it right.

sedgequeen · June 22, 2019, 4:51pm

Maybe count subsequent ID’s as 1.001, and first ID’s as 0.9999 (behind the scenes) so the common wrong, right, right pattern = more than 2/3 right?

kiwifergus · June 22, 2019, 7:00pm

Or tag in another identifier and make it wrong, right, right, right. But if expertise is lacking to the point where you can’t find someone to tag, or if no-one has confidence enough to support your ID (adding weight), then perhaps genus is where it should be sitting anyway!

jdmore · June 24, 2019, 5:40pm

I kinda like that idea. If the first IDer still felt that the subsequent two disagreeing IDs were wrong, they could just override their first ID with the same ID again, with 1.001 weight this time, and then the “regular” ID weighting would be back in play.

kiwifergus · June 24, 2019, 6:43pm

With wrong, right, right, right there is 3 chances for the wrong identifier to receive dialogue as to why it is wrong, but with wrong, right, right there is only 2 opportunities

schoenitz · June 24, 2019, 6:50pm

I’ve seen right, wrong, wrong as well (because I was one of the wrongs). Anything that makes a-priori assumptions about right and wrong is fraught in my opinion. Let the community sort it out. If it takes one more vote, so be it.

tiwane · June 25, 2019, 6:44pm

Moved this from its original thread in Bugs.

Topic		Replies	Views
2/3 identifiers agree, but observation not research grade Bug Reports	1	878	June 20, 2019
What counts towards a person's identification counts? General question	21	3275	July 14, 2019
Is a single additional ID enough to have an observation safely confirmed General	28	1421	February 7, 2024
CID more than 2 thirds algorithm General	20	750	June 8, 2023
How many identifications that agree are enough for an observation? General	28	2837	July 8, 2020

Should an observation's initial ID count the same as subsequent IDs?

Related topics