Hacker News new | ask | show | jobs
by jaykr_ 501 days ago
Seems like Gemini 2.0 Flash Thinking got silently updated in AI Studio to accept audio input, as well as image and text, making it the first reasoning model I'm aware of that works across audio. Trying it out on a few audio tasks (transcription, sound analysis) seems to perform a little better than 2.0 Flash or even Pro. Curious what you guys make of it!