Even that notion is tricky in the context of text AI. It doesn't seem obviously clear to me what the implied stance towards AI training would be for many popular license types. It seems more like the creators of many licenses didn't explicitly consider this use case.
And the more meta question is whether the creators' rights should even extend to that realm. Can you specify in a license "This text must not be read by students in the context of an educational course" (which is arguably the closest analogue to AI training)? If that's not possible, then where do we draw the line between tools-assisted learning by humans and human tools learning on their own? You can arrive at answers, but it seems to me like there are many possible reasonable interpretations.
Why is it fair use if you train a model on copyrighted material and use its transformative output, despite going against the will of the author?
Can we all legally pirate educational books since it's for self training and producing transformative outputs? Can I consume all media (books, movies, music) the same way, and call it fair use? Also for software?
Content that hasn't been legally obtained wouldn't be legal to consume in any way. That's not what I meant, and I think that's a bit of a disingenuous interpretation of my comment.
The debate is about content that is generally legally obtained, but might come with certain restrictions. Where restrictions is a broad term and might also just come in the form of a copyleft license, eg. My main point was that in many situations, eg involving open source licenses, it's really not clear from the terms what the creator's intent regarding AI training was. And the broader question is whether training is fair use, or something else, maybe even a new legal concept that would have to be established. Or, what's the difference between art students going to the museum to be inspired, and Dall-E 'looking' at public domain images?
I acknowledge I stretched a bit your comment. I meant it as a way to try to find the line on what's OK with these new AIs rather than ill-intended O:)
Regarding your point, I wonder how different is it to break the "terms of use" for fair right, vs breaking the "terms of obtaining" alltogether. They both are about jumping over owner's will, who has the full rights of the work.
I very much look forward to see how the ethics and law around these issues evolve.
And the more meta question is whether the creators' rights should even extend to that realm. Can you specify in a license "This text must not be read by students in the context of an educational course" (which is arguably the closest analogue to AI training)? If that's not possible, then where do we draw the line between tools-assisted learning by humans and human tools learning on their own? You can arrive at answers, but it seems to me like there are many possible reasonable interpretations.