|
|
|
|
|
by triceratops
411 days ago
|
|
> by being shown an excerpt [of copyrighted material] How is this done? Are bits not written into RAM or disk? Are they not sent between machines in a training cluster? That's copying. > it is seemingly not far removed from how humans consume content Except that humans don't make full copies to RAM, or disk or paper. |
|
AI doesn't need lasting copies to train, however I don't know what the actual implementation is. But if it's ruled that they can only use copyrighted data if it's not stored for more than the time it would take a human to consume, It wouldn't really cripple the models, but perhaps make training more logistically challenging.
It's important to understand that models are not data archives. They are statistical constructs made from getting quizzed, that uses human made content to generate the quiz questions.