Screenshot from a bird voice identification app as a photo

Yes, I don’t disagree with any of this. I do hope eventually iNat will implement the automatic spectrogram (I think they haven’t yet due to logistic hurtles, not because they don’t want to).

To an extent this is true too, but not exactly. In an image the proportions of the object to ID stay the same no mater how you photographic it. This is not true of changing a spectrogram scale because the x-axis need not necessarily change with the y-axis. I talked about that in more depth in the linked thread. However, you are correct, many other aspects of photos are not standardized.

This is unlikely without standardized spectrogram scales (not to mention many other display features such as color scale, etc.). I build automated audio recognizers for my work, and the first step is always to standardize the spectrogram (the same is done for other existing bird song recognizers; e.g., Merlin).

2 Likes