Hacker News new | ask | show | jobs
by GaggiX 150 days ago
I love that everyone is making their own TTS model as they are not as expensive as many other models to train. Also there are plenty of different architecture.

Another recent example: https://github.com/supertone-inc/supertonic

4 comments

In-browser demo of Supertonic with WASM:

https://huggingface.co/spaces/Supertone/supertonic-2

Another one is Soprano-1.1.

It seems like it is being trained by one person, and it is surprisingly natural for such a small model.

I remember when TTS always meant the most robotic, barely comprehensible voices.

https://www.reddit.com/r/LocalLLaMA/comments/1qcusnt/soprano...

https://huggingface.co/ekwek/Soprano-1.1-80M

Thank you. Very good suggestion with code available and bindings for so many languages.
Thanks for heads up, this looks really interesting and claimed speed is nuts..