| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by abel_ 1623 days ago

An interesting reflection is how quickly research around TTS/STT has progressed. I remember reading [0] thinking we were a long ways away. And things will get way better with multi-task learning and multi-modal learning in the coming years (or months really).

In fact, just a year after this post was written, CoquiAI started their open source projects [1].

[0] https://news.ycombinator.com/item?id=22869365 (https://thegradient.pub/towards-an-imagenet-moment-for-speec...)

[1] https://star-history.com/#coqui-ai/TTS&coqui-ai/STT