Audio and CV suggestions

swampster · March 13, 2024, 6:15pm

No, it does not.

Many things have to happen first before this is possible. Audio recognition often uses similar models as image recognition, where the computer is trained on spectrogram images of the audio rather than “hearing” the audio. So the first step would be to have iNat generate a standardized spectrogram (proposed here; discussion of logistical hurtles can be read about in that thread). Then some form of moving-window recognizer has to be made because it doesn’t really work to have the computer look at the entire audio clip (it needs to take it segment by overlapping segment).

Topic		Replies	Views
Recognize sounds automatically Feature Requests	13	11667	June 2, 2020
AI sound identification? General	10	4398	August 19, 2023
Suggest ID for sounds? General	17	4004	October 19, 2021
Audio recordings of bird song/calls General	54	4824	May 9, 2023
Audio samples in the "Compare" feature of ID'ing Feature Requests under-review	10	542	October 6, 2024

Audio and CV suggestions

Related topics