Y
Hacker News
new
|
ask
|
show
|
jobs
by
gkucsko
1154 days ago
history prompts are just unconditionally generated TTS from the same model. any of those can be used as history, but for convenience 10 are provided for each language (to generate things with consistent voices)
1 comments
turnsout
1154 days ago
So the history prompts are collections of text/audio pairs?
link
gkucsko
1154 days ago
history is semantic, coarse and fine. so essentially the same thing thats getting generated just using it as an input before the generation
link
CamperBob2
1152 days ago
So how do you clone an existing speaker's voice? That's the part I don't get.
link