Hacker News new | ask | show | jobs
by vitally3643 15 days ago
> An LLM trained on GPL code does not distribute that code.

You can't simply make that assertion. You'll have to prove that LLMs do not actually contain encoded copies of copyrighted code and that they are incapable of reproducing such code verbatim.

There is no evidence for such a claim, and so your entire argument is completely baseless.

1 comments

> You'll have to prove that LLMs do not actually contain encoded copies

In law, the presumption is that an act is lawful unless proven otherwise. The burden lies on whoever claims a violation occurred. I already went into the case of sufficiently similar reproduction in my previous response.

I mean… it's been common knowledge for a while that they do in fact contain the original data.

https://www.reddit.com/r/programming/comments/oc9qj1/copilot...

You can disagree all you want, but there's ample evidence of this.