Something needs to be done about long audio recordings

In the age of the Merlin Bird ID app, it is very easy for observers to directly upload a lengthy audio file that supposedly contains a certain bird species. In many cases, observers will upload the same minutes-long recording multiple times to create an observation for each of the species present (or each of the species suggested by Merlin).

As a frequent identifier of bird sounds on iNat, this is excruciating. It is common for the indicated species to not actually be in the recording. Or, with no indication of when the indicated species occurs, it can be difficult to pick out a single call note in the background of a 60 second audio recording. As a result, it’s clear that some identifiers just skip past these recordings. Other times, someone disagrees with the observer’s ID simply because the first or loudest bird is something else entirely.

It would be great to have some sort of warning pop up when someone attempts to upload a 30+ second recording that suggests they either (a) trim the recording to focus on the species indicated in the observation or (b) include a time stamp in the Notes (Merlin makes it easy to find the time stamps for a suggested species, even if you don’t know the ID).

How do you handle these? Are you too tempted to skip these longer recordings, especially when you see it’s been uploaded 6 times for different species?

(I’m not posting this in the Feature Request forum for now, as I’d love to hear more about what other folks think about this problem and how they approach it.)

I haven’t noticed too many long recordings. What do you consider too long? I do upload the occasional Merlin recording but I always keep them fairly short (~30 secs). I think I may have a couple longer than that but the bird was doing something cool in those. Also, I only upload ones where the subject bird is calling in the first 10 secs of the recording.

Lately, I’ve seen quite a few in the 2–3 min range where the species indicated only calls once or twice. Personally, once a recording gets past 30 seconds, I’m less likely to listen through the whole thing if the bird doesn’t call in the first few seconds.

While I don’t usually bother to post audio recordings of birds I’ve been able to take pictures of, If I can hear one using merlin but can’t see it and its loud/clear enough to reliably identify I typically add a screenshot of the spectrogram for that bird’s call along with the audio

Perhaps you could suggest free open source software to users with the capability to crop / splice / edit audio? - https://www.fosshub.com/Audio-Editors.html

I rarely upload bird audio but when I do I cut out useless noise (wind, silence, unrelated calls). Can’t recall the name of the program I use

Are you certain these are coming from Merlin? I’m not doubting you, I’m just wondering if that’s an assumption or not.

For what it’s worth, I don’t consider an audio observation to be ā€œlongā€ until the total length of time is over two minutes. Of course I’d prefer a cropped audio clip, but many users won’t know what is or isn’t relevant – or, they may not know how to edit audio. If it became an issue in my region I’d politely reach out to the user(s) who cause the problem.

I’m much more frustrated with recordings that haven’t normalized the sound, because otherwise I can’t hear the bird on my laptop’s speakers, and it’s a pain to get out headphones.

That said, no, I’m not interested in playing where’s-the-bird-you’re-asking-about, whether it’s in an uncropped photo or an uncropped audio file. I do enough of that with my own bad photos and recordings…

Sometimes they include screenshots! But, nowadays, nearly everyone in North America who is recording bird sounds on their phone is using Merlin, so it’s a pretty safe guess. (Of course, if they aren’t using Merlin, then I assume they know which sound on the recording is the bird they are identifying, and see above.)

I’ve assumed most are using the iNat app to record the bird songs. That’s what I’ve done. And if the bird hasn’t given a decent vocalization within 30 seconds I discard that recording and try again.

If this is true I feel very old. I’m still over here taking voice memos like it’s 2010. Never touched these newfangled bird apps. lol

I came across several 5 minute recordings a year or so ago. I asked the uploader to say where in the recording the sounds were that they were interested in, because 5 minutes was too long to expect anyone to listen to. They said it was part of some frog census, and they had to do 5 minute recordings. From what I saw none of their observations were given IDs beyond ā€œFrogs and Toadsā€. Whatever the recordings were for, iNat wasn’t the place for them.

The thing that made me switch is how useful it is to see the spectrogram on the screen, very nearly in real time. There are some bird sounds that I struggle to distinguish by ear, but seeing the shape on the screen helps me to hear the difference.

https://www.inaturalist.org/observations/344859671
This is almost three minutes long; I heard a bird and walked toward it, with the recorder running, in case I frightened it away. I provided timestamps when someone asked, but it still hasn’t been identified.

Most of my recordings are shorter than a minute, but sometimes I hear a bird and start the recorder, but the bird doesn’t say anything for a minute.

I don’t have Merlin. I do have Audacity, which has helped me to measure the pitch of a bird, but it doesn’t tell me what kind of bird it is.

But Audacity would let you trim the parts where the bird doesn’t say anything.

With my own recordings I consider a ā€œshortā€ recording less than 10-15 seconds (nocturnal flight call recordings often get cropped to only a couple seconds) and get bored of listening if they go beyond 25-30 seconds. I listen to bird recordings regularly for work, and 3 minutes feels like a very long time to be sitting inside and listening to a single soundscape. For the purposes of iNaturalist I feel like that long is excessive and multiple <30 second recordings would be preferred by anyone. Even in 30 seconds there can often be enough species vocalizing that it’s unclear which is the target.

But on the other hand, it does help a lot to hear repetitions of a vocalization because it often happens that a single vocalization is unusual and not representative of what the species (or the individual) normally sounds like.

I downloaded your file and used Audacity to clean it up. I don’t know what bird it is though. :woman_shrugging:

The tool ā€œiNaturalist Sound Classifierā€ with ā€œBirdNet v 2.4ā€ suggested that this could be Melospiza melodia (90.0 to 100.0s) or Parus major (57.0 - 60.0s). ā€œPerch V2ā€ also suggests Melospiza melodia

No clue if that’s correct

Good to get this feedback. I’ll be more aggressive about shortening my recordings (and will continue to add time marks in the Notes.)

FYI for folks new to recording bird songs, I use ocenaudio to edit my recordings. It’s one of the ones recommended by Cornell, and it’s fairly easy to use. https://www.macaulaylibrary.org/resources/audio-editing-tutorials/prepare-sound-files-in-ocenaudio/

I am not a bird person, but I will try and get audios. I have trouble processing sound and I often don’t know which bird sounds I have ā€˜captured’. If my husband has the time I ask him to tell me which is which so that I can put it in the notes but often I have to rely on the iNats community.

I hadn’t heard of Merlin and was unaware being able to edit the audio once uploaded - all my bird recordings are done through the iNats app.

A frog researcher told me he needs recordings longer than 10 secs (sth Aust)

Oh, good; I’m not the only one, then. We can have our own little club. I brought cake. :laughing: :moon_cake: