Hacker News new | ask | show | jobs
by krapp 932 days ago
Either buy rights to the data, produce training data for which you own the rights or use copyright-free data. Those options exist, but no one takes advantage of them because none of them are as much of a "free money machine" as just ripping off as many people as possible to homogenize and commodify their work.

If LLM development can't continue without violating copyright then that makes it clear that the purpose of LLM development is violation of copyright. Which is something we all already knew but it's nice to have it spelled out in no uncertain terms.

2 comments

> If LLM development can't continue without violating copyright then that makes it clear that the purpose of LLM development is violation of copyright.

This is a very extreme view. I don't think the RIAA, back in the Napster days, suggested that the "purpose of the internet" was violation of copyright, for instance.

No one ever said development of the internet couldn't continue if copyright had to be respected, either, so the proof is in the pudding.
What do you think of the explination that the purpose of copyright is to prevent LLM development?