Hacker News new | ask | show | jobs
by ttt11199907 925 days ago
I've always thought that these apps are missing a feature-set: Manipulation of the sound clip prior to identification. In both this app and BirdNET the selection always selects from the lowest recorded frequency to the highest -- I would like to be able to select the frequency band I'm interested in. Maybe first select time, then put corners on the rectangle to be shaped. The sound file passed to the identifier would appear cropped such that the unselected region is totally silent to the identifier. I am often trying to identify an owl or other lower-registered bird but it is unidentifiable because of noisy crickets/cicadas.
2 comments

It would be useful to be able to change the spectrogram parameters to achieve what you’re suggesting, but it’s likely that this might cause the model to misbehave if it was trained only on spectrograms of audio samples that weren’t frequency-band limited.
There are applications that have features like this (for example ArtemiS by HEAD Acoustics). I have long wanted an open-source solution for this.