Hacker News new | ask | show | jobs
by visarga 704 days ago
LLMs don't steal/pirate works, "copying" is the word you're looking for. So much faster, cheaper and precise to copy than to use LLMs. What LLMs do is to combine user information with patterns learned from data. Almost never a full work, they rarely generate more than 1000 words at once. Books are 100x larger
3 comments

Actually, "launder" is the word I'd use for LLMs.

Clean things up just enough that it's difficult to prove where their semantic content came from.

How useful this is or isn't, and how much of a threat to rightful IP owners, depends very much on the type of IP.

If "laundering" IP is made illegal, then we're all in for a huge surprise. Almost everything we say and do has been said and done before. And we're rarely the originators of our main ideas, we "launder" 99.99% of what we know, even subconsciously. Any one human could be suspected of secretly using AI today.
Totally agree, the difference is speed and scale.

Copyright laws didn't need to be invented until the printing press came along, because the act of copying was slow and difficult.

Not a fan of the patent model for software, for example, but perhaps this is an argument for it. Or, we just get used to the fact that idea-reuse is cost-free, accept a couple of decades of uncomfortable economic dislocation, and get on with it.

I don't think "copying" applies to what an LLM does either!
"Good artists copy. Great artists steal." - Picasso