| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by rajman187 462 days ago
	Not a lawyer but would assume downloading material from libgen is, in the vast majority of cases, illegal because it's a breach of copyright or similar. That’s gotten Meta in quite a spectacle of late [1] [1] https://www.loeb.com/en/insights/publications/2023/12/richar...

2 comments

maxloh 462 days ago

CommonCrawl is composed of copyrighted contents too. You gain copyright on your work automatically the moment you created it, including this very comment.

link

AmazingTurtle 462 days ago

What if I repost your comment without your permission?

link

maxloh 462 days ago

One could argue that using copyrighted content in LLMs, much like reposting, should fall under fair use. This is also Microsoft's claim in the GitHub Copilot lawsuits. It's up to the court to decide though. (IANAL)

link

fulafel 462 days ago

In many jurisdictions it's just sharing that is illegal, not obtaining.

link

akx 462 days ago

Yes. The interesting legal question is that are you sharing the original knowledge if you've transformed it via teaching it to an AI.

https://www.reuters.com/legal/litigation/ai-companies-lose-b... reports on the ongoing case on the image generation side of the fence.

link

maxloh 462 days ago

That is called copyright laundering FYI.

link

anon373839 462 days ago

It’s a catchy term, but loaded. Copyright protects only original expression, not ideas and information. So if a computer algorithm reads the former and outputs the latter, arguably copyright isn’t involved at all.

There are plenty of good counterarguments to this as well, when you consider the effects of automation and scale. I’m definitely interested in seeing how the jurisprudence develops as these cases go through the courts.

link