Why not empower recognised experts?

sbushes · July 16, 2020, 10:53am

Ok, wow. Well you are lucky you have two sources in botany!

Lets take other families in Diptera.
Of the other 100 or whatever families in the UK, hoverflies is the only one with a field guide.
Most of the rest of the information is scattered, difficult to find or access, let alone fully understand as an amateur. Keys might not exist, might be inaccessible or out of date in comparison with the knowledge the specialist has. Most of the time, there isn’t a simple way to double check… unless you are one of the specialists…or you want to put ludicrous amounts of research into each ID.

This leaves me with the choice mentioned in the other post

leave ID to languish
or try and proportionately increase accuracy of AI and dataset for everyones benefit

In UK Diptera, I certainly wouldn’t be able to trust any range data provided by iNat, without first checking to see if the data had been verified…by one of the experts…the IDs also most likely wouldnt be included in the range data(?), if someone didn’t blindly second the expert in the first place…

astra_the_dragon · July 16, 2020, 10:59am

I do see your point, though I still disagree. I wish that these experts would collate and publish resources for others to use and verify. I think it would solve a lot of problems if more information were available to the public, and not only locked up in the brains and ivory towers of academics.

botanicaltreasures · July 16, 2020, 12:51pm

I can identify with feeling frustrated on the lack of easily accessible, accurate info about tachinids. My goal is to improve the existing observations in iNat how I can by annotating the tachinid eggs and pupae I come across for life stage. Getting a species ID can be like finding Eldorado. However, experts from other countries can often get adult Tachinids to the correct family if not genus.

nathantaylor · July 16, 2020, 3:24pm

If the ability to download the identifications of individual users were possible, this still might be an issue, but there would be a really good workaround. As is, it is essentially impossible to separate expert IDs from novice IDs. I would like to see the ability to sort by the name of an identifier (annotations for specimens too), but I wouldn’t know who to contact.

I agree with @aspidoscelis, but think that if the ability to download user identifications were implemented, there wouldn’t be as much need to get violently upset about differences of opinion in taxonomy. There’s still the potential for hostility, but at least it’s not the end all, be all if you want the data. In those situations, it’s of coarse better if a single name can be reached so that an observation can reach “research grade” (I like the term “community reviewed” better).

Ultimately, I view this as a patch to the root problems which need to be worked on more, but it is a patch that would at least make data quality sorting possible. It’s a patch I’ve wanted for well over a year now (probably a few years now actually) and I think its something that iNaturalist desperately needs for researchers as a whole to see it as a valuable tool.

sedgequeen · July 16, 2020, 5:41pm

We know taxonomy isn’t simple. Our minds was either/or answers, but nature has no interesting in making us happy. Therefore, I have made ways to deal with taxonomic difficulties.

Is the Blackberry That Ate the Pacific Northwest Rubus armeniacus or R. bifrons? A lot of people are certain, but the answer depends on your taxonomic opinion of the relationship between these names. So I just “agree” to either one. If there are conflicts or questions I explain, but I figure researchers will have to look up both names if they want all the records.

In Sedum section Gormania, the taxonomy has been seriously revised and iNaturalist isn’t rushing to change. (The Jepson treatment is in my computer now for editing and may be posted soon; then iNaturalist will update, I hope.) So some Sedum I don’t identify at all. Some I identify by the current iNaturalist name and write a note saying what its new name is. Some I identify as “Sedum” and briefly explain the problem. Probably frustrating for the photographer and certainly frustrating for me, but taxonomy is like that.

So, there are, I think, ways to use the current system to express taxonomic disagreements in iNaturalist.

sedgequeen · July 16, 2020, 5:43pm

I switched to plants mainly because I love doing taxonomy but changes in North American bird taxonomy had come down to “Do we split or lump this group based on variation we all know about?” The plant questions run deeper.

sbushes · July 16, 2020, 5:58pm

To be clear, my thoughts around “disempowering” newcomers were really with those who literally just signed up. They have less than 100 obs in a taxa… and no ID experience… and start adding 10s or 100s of species level IDs in taxa where its not possible, without realising the time it can take others to correct, or the impact this might have on the AI / dataset.

So when I said disempowering… I meant more limiting their powers until they understand the power they are being attributed.

The forum has basic levels of trust…why wouldn’t we have that with identification?

fffffffff · July 16, 2020, 6:11pm

That way some real experts will never be at the same “level” as other users, there’re many of them who don’t upload any observations at all.

sbushes · July 16, 2020, 6:26pm

I’m not saying they need to upload observations per se…

I’m literally just on about an implementation akin to the forum intro.
Even just the smallest intervention possible to explain the basics to users on arrival.
A tutorial, a note, a popup, an email. How does it work with new users at present?

As it notes on the Discourse link mentioned by @bouteloua in the other thread, this is about :-
“Sandboxing new users in your community so that they cannot accidentally hurt themselves, or other users while they are learning what to do.”

fffffffff · July 16, 2020, 6:34pm

I agree, more and more popups and tutorials, probably more is needed not to get new users separated in any way, but to make clear for them what iNat is about, probably having each new member reead through tutorial, more complicated than one we have today for the app, with links to forum and popular questions.

sedgequeen · July 16, 2020, 6:50pm

The great thing about iNaturalist data isn’t that it’s always correctly identified (it may not be) but that (1) there’s lots of it and (2) it’s verifiable. It may or may not be right, but if you need to know, you can find out. That’s really valuable.

bouteloua · July 16, 2020, 6:57pm

You can see what new app users view here https://forum.inaturalist.org/t/guiding-new-users-without-scaring-them-off/2242/5?u=bouteloua

sbushes · July 16, 2020, 8:03pm

Ok, good to see, thanks!
Its a bit difficult for me to imagine, as I don’t use the app. Will have to download when I get my phone fixed :) I know trying to help my mum to use it remotely this last month, she has struggled a bit - especially when I tried to explain about things like withdrawing an ID in order to let a new ID take precedence.

I think withdrawing/agreeing is one of the crucial aspects to help explain, as people either leave their original ID incorrect without realising the impact, or they blindly agree without knowledge of ID or of identifier ( new users might even expect identifiers to be experts… without realising how iNaturalist works ).

A simple solution to this though could also just be having a withdraw button visible on ID itself in same way agree button is. Even then though, my mum struggles to understand the bigger picture.

I’m really enjoying seeing the pointers in the forum these last days - popups telling me not to limit conversation to only one person… not to post too many times in succession, etc…
I think this is the kind of thing that could really help outside the forums with guiding new users.

I can’t imagine many website users clicking through the links on the email.

bouteloua · July 16, 2020, 8:19pm

This is what new web users view when they first log in. I’ve updated the link I sent above with this screenshot:

I agree onboarding could be a bit more hand-holdy. Feel free to submit some ideas as feature requests.

tiwane · July 16, 2020, 10:08pm

As has been stated before, this issue’s been discussed quite a few times throughout the course of iNat’s existence, and as bouteloua quoted me earlier, any possible “expert” rating would be based on iNat activity, not external factors.

I think better onboarding (we’re just starting to draw up some ideas now, I know it’s been a long time coming), disincentivizing unwanted behavior (eg blind agreeing), allowing to filter by identifier (as @nathantaylor suggested, I know it’s been a long time request), and other fixes can solve or mitigate a lot of the issues raised here.

I can’t speak for everyone, of course, but here’s what I’ve heard from two top identifiers on iNat what I’ve met who each focus on one difficult taxonomic group:

one expert has told me one motivation is that it’s an incredible way for them to practice and learn because they’re seeing photos of varying quality from all over the world of their taxon of interest.
another told me they really just like helping people and if they can give their time and expertise in a way that helps people learn more about what they see, it makes them happy and they believe it’s just a good thing to do.

Some others who I’ve talked to are motivated by generating the data they want and they understand it often takes outreach, humility, and patience to teach and empower people to get that data, make the right observations, and identify taxa. I understand not everyone has the skills or resources for that, but it’s possible, and benefits many members of the community.

I’m not an expert by any means, but I’m pretty good with some bits of California flora and fauna, and I just want to help people who are curious about what they saw. Maybe they won’t misID a spider or a snake and kill it next time see it, or maybe they’ll just be able to point out a flower to a friend the next time they’re on a hike. Whether that observation ever gets to research grade is beyond my power, and it’s not something I care about. And if it sounds like I have no ego involved here, that’s not the case because I still feel quite a sting if an ID of mine is corrected. But that fades quickly, and it’s a chance to learn both about the taxon in question and how I can improve myself.

I can’t find the exact words above, but I feel like there might also be a misunderstanding about the computer vision training set. We now train on ranks higher than species, so please don’t feel obligated to ID to species for the model. From the blog post about our last model:

For the first three models, we only trained them to recognize species. For the last two models, we’ve been able to train with coarser taxonomic ranks. For example, if each species in a genus has 10 photos, that might not be enough data to justify training the model to recognize any of those species, but if there are 10 species in the genus, that’s 100 photos, so we can now train the model to recognize the genus, even if it can’t recognize individual species in that genus. This approach allows the model to make more accurate suggestions for photos of organisms that are difficult (or impossible) to identify to species but are easy to identify to a higher rank

roomthily · July 16, 2020, 11:26pm

i agree with @tiwane about the identifier motivations - i don’t think you can flatten those into one idea of expertise. (i am not a trained expert in the contexts listed here but do rank pretty high up the leaderboard.) i respect both rationales he provided for myself. i’ll add two things in addition:

trained experts are in limited supply and i don’t think it’s sustainable to expect either long-term interaction or broad interaction here. and that’s considering other more specific projects, like bugguide, where those experts might participate. (as an aside, the species (any level) pages and curation are no different from field guides, etc, as references to expertise and more accessible to non-experts so it’s wild to me that those aren’t used more often to clarify an id with an in situ image more like what an observer has shared.) it highlights the need for that intermediate level of identifier that can get you to beardtongues which a) gives the observer a name to research if they’re inclined and b) better filtering for an expert/researcher to add more. it’s a complicated rubric where (again as a not-expert) i’d like to improve my own skills for when i’m out in the world but nudge the other observer to take that next step for their own id and knowledge while also being conservative knowing that either some things aren’t really identifiable from photos alone or have small differences i am not comfortable putting at a species level because folks are quick to accept and especially quick to accept if you’re on the little leaderboard.
sticking with the idea that one of the main reasons for inaturalist is connecting people to nature as an educational tool, i feel like the identification side is overlooked in that. it’s come up for me with the bioblitzes where, at least in my experiences, the feel i got was identification needed to be from a credentialed expert and i found it disempowering as someone interested in learning more deeply about my local area. like i could be told what it was but i couldn’t truly know myself. i also think that hurts when recruiting identifiers and there’s so. much. stuff. and not enough identfiers. ymmv but i think there’s a pretty good case to be made for identifying as a gateway into observing more kinds of things and that seems good for inaturalist.

anyway, i think the “follow->this observation” is not used enough and could benefit from putting those updates under “following” or flagging them in the notification stream as a starter step to new identifiers. have a hunch that some of the likes and some of the identifications are more about wanting to keep track of an observation when maybe you’d rather not have a public opinion. i also don’t mind being wrong in part because there is that “misidentifications” section so i figure something is learning from that mistake . (do people use that?)

i’m also not sure expertise really hedges against some of the less-than-good-faith issues on ids. there’s an uncommon but persistent pattern where someone will start their observation off with an impossible id and, when that’s contested, the poster will switch to a different rare possibility. like id’ing a butterfly in kansas as some british species and switching to something limited to the sierra nevadas after a more likely id is added. that’s something about the original poster that i doubt will respond to an expert. so if there’s some technical change to the system, id hope it addresses the underlying issue if possible. as an example.

cheers.

sbushes · July 16, 2020, 11:51pm

Thanks for this @tiwane.
I’ll think about possible feature requests leading from all this, as @bouteloua suggested …

I’ve really appreciated hearing everyone’s thoughts…and could probably continue debating aspects of this for a long time yet :)

I’m really just trying to reflect back the stuff I hear from UK community mainly…it frustrates me when I hear iNaturalist being denigrated or ignored, when for me it seems to be a far superior platform than the other ones the recording schemes currently use. I just wish there was more integration of the UK expertise into the community here. But…perhaps I just need to be patient, also.

While I note it… the point you were referring to was perhaps the response by @upupa-epops …

This comment from Kueda on the blog post helps clarify this, as follows…:

Training data gets divided into three sets:

Training: these are the labeled (i.e. identified) photos the model trains on, and include photos from observations that

have an observation taxon or a community taxon
are not flagged
pass all quality metrics except wild / naturalized (i.e. we include photos from captive obs; note that “quality metrics” are the things you can vote on in the DQA, not aspects of the quality grade like whether or not there’s a date or whether the obs is of a human))

Validation: these photos are used to evaluate the model while it is being trained. These have the same requirements as the Training set except they represent only about 5% of the total

Test: these photos are used to evaluate the model after training, and only include observations with a Community Taxon, i.e. observations that have a higher chance of being accurate b/c more than one person has added an identification based on media evidence

You’ll note that we’re potentially training on dubiously-identified stuff, but we are testing the results against less-dubious stuff (you can see what these results look like in the “Model Accuracy” section of https://forum.inaturalist.org/t/identification-quality-on-inaturalist/7507). The results are, strangely, not so bad. Ways we might train on less-dubious stuff (say, CID’d obs only, ignore all vision-based IDs, ignore IDs by new users, ignore IDs by users with X maverick IDs) all come with tradeoffs and all, ultimately, limit the amount of training data, which I’m guessing would be a bad thing at this point for the bulk of taxa for which we have limited photos.

I’m not sure if I’ve read this detail before… I certainly seem to constantly forget aspects of it at least! Some more things to stick in an FAQ somewhere perhaps?

sbushes · July 17, 2020, 1:43am

Sounds good! We are lucky enough to have Chris Raper for UK Tachinidae… I think he covers European IDs too, but maybe not beyond that…

jhbratton · July 17, 2020, 10:23am

Not replying to anyone in particular, I just want to stand up for amateurs. Several people have used “amateur” as the opposite of “expert”. Amateur is the opposite of professional. Neither term tells you anything about their level of expertise.

Many experts are amateurs. One benefit of being an amateur is you don’t have targets and deadlines so can put as much time as you want in to nibbling away at an area of study.

botanicaltreasures · July 17, 2020, 12:16pm

Yes, we are. Chris Rap @chrisrap identifies in more places than just Europe. Also I am inspired by the dedication of Arturo Santos @aispinsects here in the US. There are others who I’m sure deserve a mention as well.

Topic		Replies	Views
Weighted Identification General	8	854	August 17, 2019
Recruiting more identifiers General	287	24448	December 21, 2019
The benefits and drawbacks of adding coarse identifications General	175	1420	August 24, 2024
Limit the Power to Convert "Needs ID" obs to "Research Grade" Feature Requests	29	3379	July 26, 2021
Thank you to the experts General	24	1432	December 4, 2022

Why not empower recognised experts?

Related topics