Hacker News new | ask | show | jobs
by parsimo2010 465 days ago
I think in later cases we'll get some tests to apply about how much human intervention is required.

Who trained the LLM is probably not the issue, the courts would likely want to know about the training material. If I trained a model exclusively on Warhol art, and then had that model create new images in Warhol's style, I didn't do any of the creative work and probably don't get the copyright. Warhol's estate probably owns the copyright to the model generated images as they are derivative works.

I do think that a model trained on many different artists' works, with me providing substantial feedback to the model (and I can show the process), probably will at some point give me the copyright.

Somewhere there is a line:

- "Make a picture of a mouse." Probably not giving you copyright

- Using a model to erase a powerline in a photograph you took. Probably you own the copyright to the original image and the one without a powerline in it (regardless of how many other people's images the model was trained on).

- "Make a picture of a mouse, who is bipedal, wearing pink shorts, with a chip in his ear, wearing sunglasses, with scruffy whiskers, holding a surfboard, on his way to the beach to hit some waves." then updating with "make him shorter, give him blue sneakers" and then updating with numerous other tweaks until you get it just the way you want. Who knows where this lands?

I think that in the short term the courts are going to land on the side of "anything made by a model trained on existing artwork is derivative of the training set so you can't own the copyright, no matter how much you tweak it." I think eventually the courts will recognize there is some amount of input that makes the computer image the realization of a vision in your head, and not a derivative of the training set. Just how every individual musical note has been played before, but at some point, you put them together in an arrangement that is original.

1 comments

> If I trained a model exclusively on Warhol art, and then had that model create new images in Warhol's style, I didn't do any of the creative work and probably don't get the copyright.

If I watch exclusively Warhol images for years and then paint something similar I get copyright.

There needs to be a gray are, because usually art is not done in a vacuum?

> If I watch exclusively Warhol images for years and then paint something similar I get copyright.

Not necessarily. If you copy one of Warhol's works but "change it a little" then that is a derivative work, and the copyright belongs to Warhol's estate. Depending on how close of a copy it is, you would have a tough time defending your claim to copyright in court. The advantage an offending artist has in court is that they can claim "inspiration" as long as they don't admit to copying.

For a computer model the difference maker is that the court can probably obtain records of a training set, so if the training set is exclusively Warhol works it is probably easy to get a court to side on "derivative" and assume the computer does not possess inspiration.

Courts have basically baked in "gray areas" in copyright cases. The historical copyright tests are all written as to sound like mathematical formulas but everything is kind of subjective.

You also cannot train only on Warhol imagery, unless he drew billions of pics. So this is hypothetical “if”. In reality you finetune an existing network based on a dataset much larger than Warhol’s.