Hacker News new | ask | show | jobs
by rpdillon 654 days ago
> stealing copyrighted data.

As ever, I'll be a pedant and point out that "stealing copyrighted data" is not a thing.

More substantively, we don't know whether training is copyright infringement or not. The courts have yet to weigh in in any jurisdiction I'm aware of (i.e. the EU or the US).

2 comments

I didn't make any statements about rulings. I just commented about what seems to be the general sentiment this time around.

As others pointed out, if LLM startups have to go broke after not being able to steal any more data then that will just be the reality of it.

Hmm...it's not "steal any more data". Stealing is taking property belonging to another, with the intention to permanently deprive.
> As ever, I'll be a pedant and point out that "stealing copyrighted data" is not a thing.

I’m sure that if you somehow got access to ChatGPT weights and started selling them, OpenAI would be happy to call it stealing.

I'm really only concerned with what the courts call it.