|
|
|
|
|
by NicuCalcea
935 days ago
|
|
I think it may change the discussion about copyright a bit. I've seen many arguments that while GPTs are trained on copyrighted material, they don't parrot it back verbatim and their output is highly transformative. This shows pretty clearly that the models do retain and return large chunks of texts exactly how they read them. |
|
One model is trained on copyrighted works in a jurisdiction where this is allowed and outputs "transformative" summaries of book chapters. This serves as training data for the deployed model.