|
|
|
|
|
by alpaca128
945 days ago
|
|
> Requires a 3 second (!) clip of the voice you want to clone. Sure, if you want a guaranteed uncanny valley experience.
There is no way a few seconds are enough to cover all the ways a specific person pronounces things. A person's voice is much more than just the pitch and with a 3 second sample anyone who knows them will be able to tell something's off within 3 seconds. |
|