Hacker News new | ask | show | jobs
Gemini 2.0 Flash Thinking – Audio Reasoning (aistudio.google.com)
2 points by jaykr_ 501 days ago
1 comments

Seems like Gemini 2.0 Flash Thinking got silently updated in AI Studio to accept audio input, as well as image and text, making it the first reasoning model I'm aware of that works across audio. Trying it out on a few audio tasks (transcription, sound analysis) seems to perform a little better than 2.0 Flash or even Pro. Curious what you guys make of it!