Question About Audio Observations

I’m pretty sure Merlin is computervision, it’s been trained on spectrograms and not actual audio files. I remember years ago when we discussed training a model for audio, the inclusion of time in audio makes things much more complicated.