Hacker News new | ask | show | jobs
by radeeyate 501 days ago
I feel that the audio interpreting aspects of the Gemini models aren't just STT. If you give it something like a song, it can give you information about it.