Hacker News new | ask | show | jobs
by monological 1046 days ago
Podshorty does something kind of similar, but it takes any YouTube link, summarizes it and generates a podcast using the voices of the original speakers. Also creates transcripts so you can follow along. https://www.podshorty.com
2 comments

This is copyright infringement, not ok to be using and monetising off of someone else’s voice. Over the next months there are plans for way tighter controls on this so I’d expect that the “using the voices of the original speakers” feature will not be available, unless a monetisation method is developed.
Actually, according to US law, it doesn't appear to be copyright infringement. Do you have anything to back up those claims?

According to Butler v. Target Corp., it was held that although lyrics to a song are copyrightable, the underlying voice is not. As such, there is no copyright protection available to the infinite number of words or phrases a person might utter in their distinctive voice.

Additionally, the synthesized audio can be considered derivative, as it transforms the the "audio" into something entirely different than original, and so falls under 17 U.S.C.A § 103.

So, I'm not sure what you mean when you say there are plans for tighter controls. Care to back that up?

Disclaimer: I am not a lawyer and this is my personal opinion.

Sorry what is exactly your claim here? That it is legal to use and monetise off of someone else's voice? Using Morgan Freeman's or Joe Rogan's voice for example to host your podcasts?
I'm just pointing out that there is a lot of legal gray area when it comes to AI-synthesized voices and copyright. It still remains to be seen how the courts end up ruling.

What's the difference between using an AI voice and Bill Hader or Jimmy Fallon doing a celebrity impersonation on his show and monetizing that?

Yes. As long as you make no claims as to who's voice it is, that is.
Their website seems to be plastered in claims that it "uses the original speaker's voice"
Voices are not protected by copyright.
The barriers to entry are so low (for now), and everyone is scrambling to build a moat.

The AI image generation space has hundreds of players. Audio has dozens.

One likely outcome is that big tech will come to each of the "successful" companies with close peer competitors and offer to buy them. If they say no, they buy their competitor. Or build it internally.

You'll have to run really fast and hard to survive. I think it's totally doable, though, and this is a very interesting attack gradient.

Best of luck! It's exciting times.

Thank you!