Hacker News new | ask | show | jobs
by khelavastr 8 days ago
This is shady. Copyrighters absolutely not get to control use of their copyrighted material when people mentally, sonically, or physically reproduce it for personal use.

It's absurd to say "you can't record this book to a friend or robot".

Nobody seems to actually reproduce the copyrighted materials.

High-dimensional eigendecompositions which underpin AI similarity are some of the most literally derivative materials of texts that you can imagine.

1 comments

So you record a copy "for a friend" and then you sell lots of those copies as your business. All within your rights! What's mine is yours, my Comrade!

(my point being that it would be different if the product CommonCrawl provides were trained models, but this is not the case: its product is unlawful reproductions of copyrighted data for commercial use)

> then you sell lots of those copies as your business

Common Crawl is not a business and is not selling anything.

It's awfully hard to claim you aren't selling anything when you're giving other people's stuff away for free and they would be selling it if you hadn't, um, "saved them the trouble"