|
|
|
|
|
by Arnt
898 days ago
|
|
We use copyrighted materials all the time without permission. You and I both read the Verge article without the Verge's permission. Reading is the intended and most common use of the Verge's articles, and neither of us asked for permission. I didn't print that one but I often do print, always without asking anyone's permission. Copyright has that name because copying is exceptionally protected; general use is not. You can argue that training is a kind of copying, since it involves copying of things from RAM to RAM, etc. I find that difficult, since we've established that e.g. this browser's copying of web page contents from RAM to RAM isn't. If you don't argue that training is copying, then you can argue that since training is a necessary prelude to copying, it should be treated like copying legally. I disagree, because various kinds of fair use also has the training as a necessary prelude (and, uh, the purchase I mentioned could also be a necessary prelude to copying, if my goal was to copy the album). |
|
You don't need permission as a human, to read content if it's freely available. It was explicitly made for us to consume with an expectation of returning value. Mainly advertising.
Laws are (in theory) put in place to ensure a fair playing ground. If Company B requires content from Company A, but is causing financial damage to company A by using it, this is not fair use.
I'd also like to add the intentions. Even if we decide to quote the paper later in the day, I would say it's fair to assume 95% of readers do not intend to copy the content for profit.
OpenAI on the other hand is explicitly intending to copy the material to reuse into its own content in millions of generations for profit.
No metaphors needed.