Hacker News new | ask | show | jobs
by dazhengca 2405 days ago
How could there be enough samples of his voice? Seems like quite a jump to “it was a deep fake” unless there’s more info they haven’t revealed
2 comments

From yesterday: https://news.ycombinator.com/item?id=21525878

Only requires 5 seconds of voice audio to synthesize believable speech.

The cadence of the synthesized voices are still noticeably artificial, even for the short demo phrases. This is not to say this isn't impressive. But how much does this method improve when it isn't constrained by a 5-second sample? If we feed it several hours of public speeches from Martin Luther King Jr, or hell, tens of hours of audio from President Obama or Trump – will it have the same artificial cadence, even if the tone and pitch of the imitated voice is accurate?