|
|
|
|
|
by stale2002
676 days ago
|
|
> Try to do something significant with the pirated content, as the AI startups do, and you'll immediately get trouble. No they won't. 80% of the people I meet at tech events either have an AI startup or are working at one. Basically all of them are doing legally questionable stuff, when it comes to copyright law. Anyone who has ever trained a model, or finetuned a model, is likely using other people's data. This is universal. It is almost everyone. And yet, despite this being a gray area, basically nobody is getting into trouble for this. We are all getting away with it. And there is only a singular lawsuit about this against like 4 companies and nobody else. (midjourney, runwayML, Stable diffusion, and deviant art) So the point stands. Almost nobody is getting into trouble, despite this behavior being widespead everywhere, and if you don't do the same thing then you are going to fall behind and fail. |
|
Citation needed.
The whole reason there's no legal trouble is that it cannot be proved conclusively that they used copyrighted data for training.
"when the CTO of OpenAI is asked if Sora was trained on YouTube videos, she says “actually I’m not sure” and refuses to discuss all further questions about the training data". Why do you think she's denying the obvious?