|
|
|
|
|
by zamadatix
385 days ago
|
|
The (American English) voices are absolutely amazing but the tags for laughs still feel more like an "inserted dedicated laugh section" than a "laugh at this point in speaking" type thing. I.e. it can't seem to reliably know when to giggle while saying a word, "just" giggle leading up to a word. |
|
Even though ElevenLabs remains the quality leader, the others aren't that far behind.
There are even a bunch of good TTS models being released as fully open source, especially by cutting-edge Chinese labs and companies. Perhaps in a bid to cut off the legs of American AI companies or to commoditize their compliment. Whatever the case, it's great for consumers.
YCombinator-backed PlayHT has been releasing some of their good stuff too.