| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by beastman82 93 days ago
	The assumption here is that the people who maintain something in a painstaking manner did not intend people to take it and do whatever they want with it in accordance with its license?

1 comments

sumeno 93 days ago

"in accordance with its license" is the key part that's missing with LLMs. The licenses are completely ignored.

link

nickff 93 days ago

It seems to be a common view on HN that licenses and conditional access to websites should be ignored (i.e. WRT ad-blockers), but also that licenses on Open-Source Software repositories should be respected (i.e. WRT LLM training). I believe that holding these contradictory views is common, but the conflict would need to be resolved to come to a conclusion on how to proceed with LLM training.

link

pseudalopex 93 days ago

There is no contradiction. Open source software licenses allow use without conditions. Ad blocker use does not distribute the modified web pages.

link

nickff 93 days ago

I have not seen any evidence that LLMs ‘distribute’ modified software, though they do seem capable of replicating it.

link

lavela 93 days ago

I fail to see how mass scale reproduction of copyrighted code isn't a form of distribution.

link

nickff 93 days ago

Replication is not the same as reproduction; I can replicate an API without violating someone's license or copyright (which I would by reproducing their work).

link

pseudalopex 93 days ago

The view LLMs should respect open source software licenses is not for replication alone. Models and generated code are derived from training data.

link

nickff 93 days ago

Developers are permitted to learn from open source code with restrictive copyrights, and apply those lessons to developing other software which does not comply with the copyright of their 'example'.

As an aside, I do believe that LLM trainers are ignoring and violating many licenses, but open-source software is not a clear example of a violation.

link

no-name-here 93 days ago

> Open source software licenses allow use without conditions.

Don't a number of open source licenses notably involve restrictions?

link

technothrasher 93 days ago

You seem to be conflating copyright with access rights. Two very different things. Regardless of your feelings on either, there is no contradiction in holding different views on them.

Well no, it’s about legally gating the ability to copy so the original author doesn’t have to compete in the same market to sell his own book with every other bloke with a printing press and a copy of the book. Everything else is an addendum.

link

joquarky 93 days ago

No, it's to promote the progress of science and the useful arts.

The current implementation has recently become obsolete.

link

nickff 93 days ago

That may be the reason copyright came to be, but it's much more expansive now.

link

dartharva 93 days ago

> The licenses are completely ignored.

Where and when? In cases where LLM coding assistants reproduce copyleft code in someone's work assignment? The responsibility in those would be on the user, not on AI.

link

patagurbon 93 days ago

In reproducing code that requires the license be reproduced alongside it.

link

sumeno 93 days ago

Are you doing a full search of every GPL licensed repository every time you use an LLM to ensure that it isn't giving you GPL licensed code? That doesn't seem reasonable

link

bayindirh 93 days ago

This is what GitHub promised years ago. Showing repositories where similar code is present so you can guess the license and use appropriate outputs.

I’m not sure whether this is implemented or not since I don’t use generative AI for coding.

link

dartharva 93 days ago

Why not? Up until a year or two ago LLM pair programmers weren't even a thing.

link

pseudalopex 93 days ago

The user would know how?

link

joquarky 93 days ago

That's because licenses are an abstract complexity tacked on to a simple material reality in order "to promote the progress of science and the useful arts".

Just like many cultural rules, they keep growing in complexity until they reach a phase change where they become ignored because they have become too complicated.

link

wk_end 93 days ago

OSS licenses haven't grown in complexity all that much in the past forty or so years. They're being ignored more now because it's become easier to ignore them, not because it's become harder to abide by them.

link