Hacker News new | ask | show | jobs
by spywaregorilla 1250 days ago
I feel like it would be much harder to create a set of hard controls, like MIDI, to affect the voice acting vs. trying to do a co-embedding space of voices and descriptions of the voices and just saying "Say this quietly and meanly". Thoughts?
1 comments

Exactly! Only issue is having a well-labelled dataset with those type of cues. We have an idea on how to do it though!