Hacker News new | ask | show | jobs
by ggnore7452 485 days ago
How’s this compare to likes of Fish audio? Wish they support voice clone using longer audio tho .

Haven’t looked into this space for few months , but iirc, previously SOTA was like GPT VITS or something ?

1 comments

This is the clear SOTA at the moment, even better than ElevenLabs in a technical sense, because you can specify emotion, speed, etc.