| > My assumption would be 'fair use'. Why? That's not obvious to me at all. These algorithms take the entire image and feed it into their maw to generate their neural network. That doesn't really sound like "fair use". If these GPT systems were only doing scholarly work, there might be an argument. However, the moment the outputs are destined somewhere other than scholarly publications that "fair use" also goes right out the window. If these algorithms took a 1% chunk of the image, like a collage would, and fed it into their algorithm, they'd have a better argument for "fair use". But, then, you don't have crowdsourced labelling that you can harvest for your training set as the cut down image probably doesn't correspond to all the prompts that the large image does. > Stable Diffusion does not create 1:1 copies of artwork it has been trained on What people aren't getting is that what the output looks like doesn't matter. This is a "color of your bits" problem--intent matters. This was covered when colorizing old black and white films:
https://chart.copyrightdata.com/Colorization.html
"The Office will register as derivative works those color versions that reveal a certain minimum amount of individual creative human authorship." (Edit: And note that they were colorizing public domain films to dodge the question of original copyright.) The current algorithms injest entire images with the intent to generate new images from them. There is no "extra thing" being injected by a human--there is a direct correspondence and the same inputs always produce the same outputs. The output is deterministically derived from the input (input images/text prompt/any internal random number generators). You don't get to claim a new copyright or fair use just because you bumped a red channel 1%. GPT is a bit more complicated than that, but not very different in spirit. |