Hacker News new | ask | show | jobs
by jameszhao00 1068 days ago
By “SOTA” tts I think you mean LLM based TTS? With sound and language tokens trained GPT style?

Without going into too much details, imo they’re not really usable right now for TTS use cases.

1 comments

Not necessarily LLM style. The above isn't for instance.

also Google Studio Voices is excellent. Definitely better than Microsoft's best, albeit very limited voices.