Hacker News new | ask | show | jobs
by GaggiX 460 days ago
He was talking about STT models, not TTS. Whisper is open source and a good solution in many cases (in particular finetuned ones).
1 comments

regarding STT we got also today 2 new models from Nvidia:

https://huggingface.co/nvidia/canary-180m-flash

https://huggingface.co/nvidia/canary-1b-flash

second in Open ASR leaderboard https://huggingface.co/spaces/hf-audio/open_asr_leaderboard

Sadly only supports 4 languages (english, german, spanish, french)