Hacker News new | ask | show | jobs
by carlbarrdahl 1231 days ago
It's making an api request to play.ht:

https://github.com/yacineMTB/scribepod/blob/master/playht.ts...

2 comments

I wonder why the title says that it uses Tortoise TTS?

Also interesting that play.ht allows you to clone others' voices.

How did they get to use the joe rogan voice though? It seems that one isn’t public?
It uses the TorToiSe TTS model for generation. It's simple to generate conditioning voice latents using short audio samples. Likely transcribed JRE episodes were part of the TorToiSe training data, explaining how it's so good at recreating his voice characteristics in particular.