|
|
|
|
|
by davexunit
667 days ago
|
|
It's so obvious to me that machine learning models are derivative works of their training set. If they weren't, then why would these companies fight so hard to say otherwise? They need that training data to make their product, so they should pay the licensing fees for it! 10 years ago, when I worked on a machine learning model for my employer, it was unthinkable to train on data we did not have the rights to use. But now it's all fair game because OpenAI executives would make a little less money otherwise? They certainly aren't giving up any of their own copyright in return. It's a very transparent transfer of power and money from regular people to the bosses. |
|
Okay, but narrative creators watch movies and listen to music and read books too. Many do indeed "file the serial numbers off" other people's work and publish something else, that makes them money and not the original creators. Does one instance of "filing the serial numbers off" by one author mean that no authors anywhere are allowed to write any books as soon as they've read "a bunch" of other books? I get what you are saying, but it's not so obvious what the right policy is. It is very hard to make it consistent when "AI" is substituted with "human," and it's not so obvious if "AI" is a distinct class from human, because it is after all, something that only exists because a programmer somewhere wrote and operated it.