Hacker News new | ask | show | jobs
by p_l 57 days ago
That is in fact correct.

Both the compiler (in absence of inclusion of copyrighted libraries) and the LLM are considered to not add creative work and thus do not change copyright status of the works they transform.

You can consider the training set of the LLM or other AI model to be 3rd party libraries and the level of copyright from them applying to final output to be how much can be directly considered derivative, just as reading copyrighted code and being inspired by it does not pass that copyright to your work unless it's obviously derivative

1 comments

>> You can consider the training set of the LLM or other AI model to be 3rd party libraries ...

I like this comparison -- training set as '3rd party libraries'. Except, of course, that the authors behind the training set may not have actually granted permission to use, whereas the 3rd party libraries usually have some permission by way of license.

The law only cares about how the work is distributed - if you acquired it legally by purchasing, yes you can train LLM on it, and with exception of moral rights in places like EU the author does not have more to say on it.

It's treated the same as human reading and learning from the work.

You have only the granted artificial monopoly on acts of distribution under US law