Hacker News new | ask | show | jobs
by erichocean 1177 days ago
I'm using this kind of technology for temporary voice tracks in animated shorts.

I'd really like something like Img2Img for voices so I can translate a performance to an arbitrary (synthetic) voice.

1 comments

Tortoise TTS can do this. You just pass it your example as a conditioning latent.