|
|
|
|
|
by QuercusMax
229 days ago
|
|
Aren't we talking about the auditory quality of the generated vocals? I'm don't understand how you could possibly think the textual training data could possibly impact the perceived vocal strain (which are actually just artifacts) of the generated vocals. |
|