|
|
|
|
|
by naet
1124 days ago
|
|
I think this is a fallacy that I see a lot in recent AI discussions. An LLM is not the same as a human brain. You might see some superficial similarities in both being able to produce a block of text, but the method by which the text is produced is entirely different. For example, we can't download entire libraries of books instantly to our brains and then reproduce those books word for word in memory. Things that operate at different scales and by different methods should have different regulations, in the same way a bike or a car is regulated differently from a truck or another piece of heavy machinery. Also, humans can be, and often are, found liable for copyright infringement or for piracy depending on how they conduct themselves. If a human was to reproduce a copyrighted book word for word, that would consist of copyright infringement regardless of whether it was done by rote memory, by copy and paste, or assisted by a black box LLM. Even if a human paraphrases another work they can still be found guilty of plagiarism if the paraphrase is still overly similar to the original source material. A human can also be guilty of copyright infringement if they use a copyright work as source material in certain ways. If I steal a stock image without paying for a license and add it in my Photoshop collage, I might be found to have pirated or infringed on the original image creator's property. LLMs are trained on copyright data and can often reproduce that copyright data. It's an open question how we regulate this. I personally think it would be fair for an artist or author to say their work was not licensed to be used in training a neural net or otherwise request to opt out. |
|
Yeah. Nobody's talking about word-for-word duplication here.
> If a human was to reproduce a copyrighted book word for word...
Again?
> Even if a human paraphrases another work they can still be found guilty of plagiarism if the paraphrase is still overly similar to the original source material.
Go look up the dictionary definition of plagiarism. Notice the most crucial element, which you seem to have omitted here, and also notice that it's irrelevant to AI systems, which overtly acknowledge that they exist to generate derivative works.
> If I steal a stock image without paying for a license
Here's another version of your "word-for-word" analogy, which nobody else is talking about.
> I personally think it would be fair for an artist or author to say their work was not licensed to be used in training a neural net or otherwise request to opt out.
I am genuinely curious: how do you propose to enforce this?