Hacker News new | ask | show | jobs
by idonotknowwhy 246 days ago
Qwen3 omni transcriber can do this. It can describe the voice, emotion very well
1 comments

I've also had luck with Gemini. If I made a few noises and asked which one was higher pitched, it could easily tell.