|
|
|
|
|
by bluefirebrand
1 day ago
|
|
> It's funny how worked up people get about copyright with respect to to AI training when using copyrighted material for training an AI model is fair use Personally I disagree with the finding that it is fair use. I think the fact that it was found to be fair use is a miscarriage of justice. Am I allowed to have dissenting opinions on that topic? |
|
The essence of fair use is that it is ok if the use of the copyrighted material is transformative. LLM training I believe is transformative, as it is taking the data (as in a work of text, an image, video), and feeding it through the layers of a neural net to marginally update its weights. It is factoring each work into a very very small fraction of the model's overall sense of the world.
Now, AI models are capable of reproducing copyrighted works to a degree of high accuracy, especially if those works recur very frequently in the training data, but I believe that's a different issue. Any video camera is capable of taking a photo of a copyrighted work, but that isn't essential to the value of the camera, though it is undeniably what certain cameras are used for. The exact reproduction of copyrighted works is a likewise something that LLM's can do, as any intelligent person could recite song lyrics or a work if they memorized it, but each individual work is only marginal to the overall effectiveness and value proposition of the model.