|
|
|
|
|
by tarkin2
1195 days ago
|
|
Most of the raw material used is available for free with no copyright on the internet, though. I have no desire to charge for my posts that are mainly for my own reference. Nor do the people who are answering on stackoverflow. But when all those are harvested to cause mass redundances there's a difference. But, at least with those two cases above, essentially reference material and technical answers, I doubt there's a copyright problem. What could regulators even do? Force the model owners to regurgitate source material references? Force the model owners to pay a small fee pemr post used? Difficult when most of the material is freely available online with ads. Besides a lot of the material will come from third parties who have scraped the posts. A whole branch of regulation could appear to try to track the source material which has been put online for free. Or even scraped and then put online again sans copyright to be rescraped without the need to pay anyone. Maybe we can ask ChatGPT to search its databases for its own sources to its own answers. And even check the source is the original. Problem solved I guess... Feed the overbrain and it'll throw you some pennies while it's used to do you out of a job. |
|
It's exactly like the current regime of copyright where I could, in principle, copy paste a file from the Linux kernel and compile it into my binary application, and nobody would know. How much would a single file from a work with tens of thousands of contributors possibly be worth, right? Wrong, it takes a single disgruntled employee (which you are guaranteed to have when you exceed a headcount of roughly 5) to destroy your business and product. The only possible way to avoid this is to train on either public/open sources or get positive authorization for each and every file you slurp for the specific use of AI training, which you definitely won't get for pennies.
As for the inevitable dominance of our AI overbrains fed on open source information, I for one, welcome them. The cat is out of the bag, it's not like we can return to the previous state of affairs. The problem, as always, becomes a political one, how to distribute the fruits of these new technical capabilities to the (human) citizens.