|
|
|
|
|
by thebrid
657 days ago
|
|
As much as I love the Internet Archive, is it really that crazy? The four factors used for determining fair use are: * the purpose and character of the use
* the nature of the copyrighted work;
* the amount and substantiality of the portion used in relation to the copyrighted work as a whole
* the effect of the use upon the potential market for or value of the copyrighted work.
In the Internet Archive case, they're distributing whole, unmodified copies of copyrighted works which will of course compete with those original works.In the AI use case, they're typically aiming not to output any significant part of the training data. So they could well argue that the use is transformative, reproducing only minimal parts of the original work and not competing in the market with the original work. |
|
> In the AI use case, they're typically aiming not to output any significant part of the training data
What they’ve aimed to do and what they’ve done are two different things. Models absolutely have produced output that closely mirrors data they were trained on.
> not competing in the market with the original work
This seems like a stretch, if only because I already see how much LLMs have changed my own behavior.
These models exist because of that data, and directly compete by making it unnecessary to seek out the original information to begin with.