Hacker News new | ask | show | jobs
by underlipton 26 days ago
The nature of how LLMs work makes it impossible to connect a derivative work to its source data in the training. However, the weights couldn't exist without that training data - the works of the creators were used during training - and the entity making money off the use of that training data is primarily the LLM platform owners. So they should pay.

We are trying to avoid another situation where "resource wealth" goes uncompensated, producers remain poor while processors, marketers, and merchants reap all the benefit. Unless your aim is something else, in which case you should state it.