Hacker News new | ask | show | jobs
by bick_nyers 1785 days ago
The implications for the entertainment industry are massive.

When I was working in indie game development, I wondered if you could use deepfakes as a voice actor. Basically get someone famous/good voice with infinite voice lines, without having to pay for studio time. Obviously, you would need them to sign-off on using their voice for commercial purposes.

3 comments

There's a post on Hacker News for this, by a company called Sonantic.
How would you get the acting part of the voice acting right? I can’t imagine you wouldn’t still need a skilled voice actor for that.
Yeah, the acting part is a valid concern. A mod for The Witcher 3 does this to give the main character voiced dialogue[1], but it doesn't really sound.... right. I mean, it is voiced and some lines feel authentic, but some lines also just feel odd.

[1]: https://www.gamesradar.com/witcher-3-mod-uses-ai-to-create-n...

Not if you're going to voice the Elcor from Mass Effect.

"With barely contained terror. You drive a hard bargain."

A markup language for voice that tells the generator how to inflect everything. It's not on the horizon yet, but anything that our voice can do, a computer will do someday.
Neat! Looks like I'm behind the times already.
Then the person doing the markup becomes the talent you have to pay to make things good.
It's a lot easier to write "cries like a baby" or "screams in terror" than it is to actually do it on command, over and over again, for take after take.

And one can even imagine a program with emotional slider bars that lets a person listen to how a line sounds with different levels of inflection and then automatically inserts the appropriate markup for the settings the user selects.

That percon can be replacable. Or it can be team. And you don't need to worry about the AI tiring or damaging their vocal cords after trying out different intonations all day. And eventually the there will be good enough automation to generate the intonattions too - either entirely or with minimal input from a voice director.
Yup exactly. Anything you would tell a voice actor to do you have in the markup. Obviously, the voice actor can still produce higher quality, probably for a long time to come.
I believe that's already out there, since services like Alexa will do certain inflections depending on the context of what they're saying. I think.
I'm working on https://vo.codes

The new version is almost ready to launch.

I've also got voice to voice conversion working, and I'm trying to make it real time. It's pretty close.