Hacker News new | ask | show | jobs
by olaulaja 1050 days ago
The thing is, most styles don't have names, only artists using them. If you remove artist names from the training data, you lose the ability to describe those styles, including as combinations (so you can't even do "influenced by rutkowski" styles). This obviously sucks for the user, and I don't think artists should be able to deny control over style just because it makes imitation easy.

In this context, putting an artist back in the model is a pretty sensible form of disagreement/protest. I admit the "I'll take this down if they ask me to" is strange, but mostly because its too weak; the LoRA should just stay up, not demand another no from someone who already made their opinion clear.

2 comments

> most styles don't have names

If you want to educate yourself on how to refer to different art styles you can start here: https://en.wikipedia.org/wiki/List_of_art_movements

> If you remove artist names from the training data, you lose the ability to describe those styles

It's almost as if artists develop their own styles that are just as uniquely theirs as any one work produced by them in that style, if not more so! When people put "by Greg Rutkowski" in their AI art prompts, they are not asking for results in a style that exists independently of Greg Rutkowski despite his name somehow being the only way to identify it. They are, in fact, asking for results in the style of Greg Rutkowski in the sense that works by Greg Rutkowski were labeled as such and fed into the training process, programmatically creating a digest of works by (at least) Greg Rutkowski that we call a "model", and that model is as a result capable of mimicking the style of Greg Rutkowski specifically. They are asking for results that look like they were painted by Greg Rutkowski, and they care enough to ask for Greg Rutkowski specifically because Greg Rutkowski is a highly skilled artist who developed his own distinctive style which is the central pillar of his intellectual property as an artist, and his competitive advantage.

Intellectual property law as it stands today obviously couldn't have anticipated this technology, and didn't. However, if this kind of thing is still legal in ten years, that will represent a major systemic failure of copyright and trade secrets law to protect the core intellectual property of artists in the same spirit as it currently protects intellectual property in other domains.

That list is only really useful for major styles that are well known/tagged, early SD versions knew a list of artists _much_ longer and more relevant than that. (some of the styles on that list are not related to 2d images). If you consider the list a reasonable substitute for artist names, I'd ask you for a prompt that replicates the common Rutkowski + Mucha style using just the styles listed.

>> It's almost as if artists develop their own styles that are just as uniquely theirs

Hard disagree on this. Styles do not and should not have ownership, legally or otherwise. Not in painting, music, film, games, or any other kind of art. Anything else kneecaps derivative styles and erects a massive barrier to the advancement of culture. Straight copies are mostly avoided because they add nothing over the original.

You're also misunderstanding the typical use of Rutkowski and other names. Most prompts I've seen and used use multiple artists for a unique blend of styles (including the much maligned and overused Rutkowski + Mucha), not just straight copies. While asking for Rutkowski indeed asks for a very specific style (as well as any themes etc common to his work), its usually in combination with other equally specific and somewhat contradictory styles, each guiding the result in some direction and adding elements but none of them being exactly the desired end result. Just asking for straight up Rutkowski was a short lived cargo cult and is now more of a meme than anything else.

> If you consider the list a reasonable substitute for artist names, I’d ask you for a prompt that replicates the common Rutkowski + Mucha style using just the styles listed.

Not looking at the list, but replacing “by Greg Rutkowski” with “detailed realistic muted fantasy” and “by Alphonse Mucha” with “Art Nouveau” seems to work pretty well for reproducing both “by Greg Rutkowski” alone and for the Rutkowski + Mucha combination. (Oddly, replacing either of those, but not the other, does not work as well for reproducing the combination effect as replacing both.)

> Just asking for straight up Rutkowski was a short lived cargo cult and is now more of a meme than anything else.

Rutkowski + Mucha is as much of a cargo cult as Rutkowski alone, even if it is a longer-lived one.

> If you consider the list a reasonable substitute for artist names

I didn't say or suggest any such thing.

> I'd ask you for a prompt that replicates the common Rutkowski + Mucha style using just the styles listed.

You're completely missing the point, which is that if you want the model to (e.g.) combine the individual styles of two specific artists in its output then you are not just looking for a style, you are looking for mimicries of those specific artists combined and normalized by the model. The model is able to do this because works by these artists were annotated as such and fed verbatim into the training process. Of course it can't mimic a specific artist unless you mention them by name!

> Hard disagree on this. Styles do not and should not have ownership, legally or otherwise. Not in painting, music, film, games, or any other kind of art. Anything else kneecaps derivative styles and erects a massive barrier to the advancement of culture. Straight copies are mostly avoided because they add nothing over the original.

You (the general you, an amalgam of the AI art defenders I've seen) claim the right to create these unauthorized derivative works via an automated process with zero reliance on novel human authorship beyond a short text string, then work backwards from there to rationalize to yourself why what you're doing is okay. Then, when artists who've invested decades of their lives into the development of their technique and process and relationship with art tell you that your rationalization is a facile one and that you are stealing from them, you are so sure you know better than them that you sweep aside their concerns and accuse them of jealousy.

In context, it's an absurd and frankly disgusting level of self-absorption. It's like the way crackpots relate to real working scientists, angrily casting the lack of attention paid to them and their "theories" as jealous conspiracy and gatekeeping rather than considering that maybe they simply have no idea what they're talking about, because why would they? But in this case the big money is unfortunately on the side of the crackpots.

I don't care about the "meta", and I don't care about "memes" or any other specifics of the subculture of AI art tool users. What I care about is that these tools obviously launder the intellectual property of artists into ostensibly non-infringing derivative works. I think the comparison with how human artists develop and apply technique and style is intellectually fascinating and worthy of discussion^[1], but from a legal and moral perspective comparing the two is like comparing a detective listening to a wiretap and taking notes by hand with the NSA's PRISM program or similar. They are superficially the same from a certain narrow perspective (and I believe there has been no end of facile rationalizations deployed to this effect), but broadly they are substantively different, morally and practically and especially in their implications for society. Everybody knows that justifying the latter on the basis of the former is foot-in-the-door bullshit that flies in the face of the spirit of the law.

Where will the law go? Who knows! All I know for sure is which side has the money.

^[1] I think if you go back in my commenting history you'll see me talking about that at length, and even taking what may appear to be the other side of this debate.

> You (the general you, an amalgam of the AI art defenders I’ve seen) claim the right to create these unauthorized derivative works via an automated process with zero reliance on novel human authorship

If they are created “via an automated process with zero reliance on novel human authorship” they, ipso facto, are not “derivative works”, which are original creations of human authorship subject to separate copyright (which, nonetheless, require permission from the copyright holder of the original work to legally create unless an exception to copyright protection, like fair use, applies.)

What you probably want to argue is that they are unauthorized mechanical copies, not unauthorized derivative works.

Of course, if they are unauthorized mechanical copies, it is probably usually a partial copy of the training set, which if it has a copyright it is a compilation copyright owned by whoever assembled the training set, not of any particular work within the training set. Now, whether the training set is an unauthorized derivative work when compiled is another question.

> What you probably want to argue

I don't really want to argue either because I'm not a lawyer and from a lay perspective AI works don't seem to fit neatly into either category. It's obvious to me that infringement occurs in spirit, which is effectively what i'm arguing; beyond that I really couldn't speculate on the details of how a successful case to that effect would be built, or how new law would be written.

> They are asking for results that look like they were painted by Greg Rutkowski, and they care enough to ask for Greg Rutkowski specifically because Greg Rutkowski is a highly skilled artist who developed his own distinctive style

Sometimes, but not always.

Lots of people prompt by cargo cult -- you find lots and lots of images generated by peppering the list with stuff like "masterpiece, best quality, 8k, award-winning".

Are all of those found in the training set? Do they actually have the right result?

It's statistics, not magic. The machine doesn't know what you mean, it only knows what's in the dataset. Some tags are not well represented enough to have the result you'd expect from the name.

Googling around suggests "masterpiece" mostly translates to higher contrast and saturation.

> they are not asking for results in a style that exists independently of Greg Rutkowski despite his name somehow being the only way to identify it.

Some might be. There's not always a tag for every kind of aesthetic, so one possible solution is to find an artist that uses it.

> which is the central pillar of his intellectual property as an artist, and his competitive advantage.

And which copyright law says he doesn't have. Style isn't copyrightable.

> Sometimes, but not always.

Sure—they often have such a poor understanding of art and the artist that they may not know that's what they're looking for, but that doesn't change the fact that it is, indeed, what they are looking for.

> And which copyright law says he doesn't have. Style isn't copyrightable.

The part of my comment you quoted says nothing about copyright law, so I'm not sure what you're trying to contradict here. An artist's personal style is a central pillar of their intellectual property in the sense that it takes massive personal investment to develop and everything they create flows from it, and it is a central pillar of their competitive advantage in the sense that it's aesthetically unique, positively stimulating to the viewer, and links their identity to their entire body of work in a way that can be exploited in the marketplace.

The fact is that copyright law currently has no idea what to make of these AI tools. I think the only concrete opinion so far adopted by the US copyright office (which may in the future be superseded by legal precedent or law) is that products of AI alone are not products of human authorship and thus cannot themselves be copyrighted. It will take time for arguments to be made, for money to change hands, and finally for more broadly useful law to be settled, which is why I previously alluded to a period of ten years to let things shake out.

If copyright law does step up to protect artists from this novel attack on their intellectual property, I doubt that protection will come in the form of allowing styles to be broadly copyrightable. From the contemporary commercial perspective, most of the value proposition of these tools is the ability to automatically and at scale launder the intellectual property of artists into derivative works that ostensibly do not infringe on the original works; given that, it is very much in the spirit of existing intellectual property law to write new law to protect artists.

Only time will tell whether the inevitable evolution of intellectual property law to account for AI tools favors individual artists or large tech corporations.

> Only time will tell whether the inevitable evolution of intellectual property law to account for AI tools favors individual artists or large tech corporations.

Definitely tech corporations. I'd be careful placing my hopes on copyright.

You know there's a lot of public domain out there, as well as permissively licensed images, right?

So here's my future prediction:

1. If forced, AI companies will prune their datasets of anything not permissively licensed.

2. They'll start friendly community initiatives to fill in the holes. Just like Wikipedia has requests for pictures, so can AI.

3. There's probably going to be more interest in doing more with less, and in training AI on AI output.

Given that AI output is deemed non-copyrightable, the end result is a vast amount of public domain pictures, which can be freely used without paying anyone.

Think it's unlikely? I think it'd very likely based on what happened with the software industry. The industry heavily encouraged Open Source as opposed to Free Software, and as a result today you can easily run a business with zero licensing concerns or payment to anyone.

And once you're there, there's no taking this back. There's not once in my life that I've thought "You know, it's nice that GCC and Clang are free, but I really feel like paying big bucks for a compiler".

> Definitely tech corporations. I'd be careful placing my hopes on copyright.

Yes, I expect you're right.

> [...] Think it's unlikely?

The part of your scenario I'm most skeptical of is the part where artists freely contribute their work to AI training sets en masse. Even putting aside the current (admittedly somewhat reactionary) trend of artists being loudly contemptuous of AI tools and (to a large extent) the people who use them, I don't think the incentives for artists to post their work online are quite the same incentives that drive programmers to open-source their work, and I really don't think having your work subsumed into a vast training set where it completely loses its identity is nearly as satisfying (or professionally enabling, for that matter) as being able to show e.g. that your open-source library gets 100k downloads every day and is a critical component of X, Y, and Z popular software packages. Art—especially art created for non-commercial reasons, which is what we're talking about here—is often very personal, and as a result artists crave direct engagement from the people looking at it.

If the hypothetical "community initiatives" you're proposing can make such contributions more rewarding, either by drastically improving attribution on the output side of the model (seems unlikely) or by offering some other incredible value proposition distinct from the "opportunity" to have one's works blended into the AI slurry, maybe. But if the AI industry and community ever do actually start listening to artists and respecting their intellectual property to the point that they are no longer simply taking what they want just because they can, I think it will be an uphill battle to convince artists to freely give the same without compensation.

It seems more likely to me that large AI companies would need to pay artists to contribute to training sets, making themselves and their peers partially or wholly redundant. I don't think that's great either, but at least it isn't bald-faced theft of intellectual property.

> The part of your scenario I'm most skeptical of is the part where artists freely contribute their work to AI training sets en masse.

You're not quite getting it.

First, there's the public domain. Centuries of art hanging in museums and the like is there for the taking. There's also some modern works there like I think works made for the US government.

Second, there's permissive licensing. For instance I uploaded a bunch of stuff to Flickr that I don't mind at all being used for AI. That's also free for the taking.

Third, if AI output is not copyrightable, AI output can be fed to itself. So lacks in SD can be filled by generating, filtering and reinforcing what you want.

Fourth, if any jurisdiction declares that training is fair game, that can also be added.

Fifth, it occurs to me that if AI output is not copyrightable, then it possibly allows proprietary AIs to be gradually blended in. Eg, say Adobe runs its own. It might own the training set, but if Adobe legally licensed everything, but the output is not copyrighted, then you can feed it to another system still and get some benefit from Adobe's work that way.

This constitutes already a starting set that can be used.

> But if the AI industry and community ever do actually start listening to artists and respecting their intellectual property to the point that they are no longer simply taking what they want just because they can, I think it will be an uphill battle to convince artists to freely give the same without compensation.

So you don't. You don't appeal to traditional artists, you appeal to AI users, who are seeing their cool tools attacked by the artists. You convince them to find more material in a public library, or to make some (since AI output is not copyrighted).

> I don't think that's great either, but at least it isn't bald-faced theft of intellectual property.

I honestly fail to see the long term benefit. I see little satisfaction in "Well, the machine took my job, but at least I fought a legal battle to make sure modern AI is trained from squeaky clean 18th century art".

Once you have a clean AI that's it, that's the end game. You can even sell that as a perk. Why deal with copyright, royalties, and all that nonsense? Here's a machine that will do whatever you want for cheap and won't ever ask for anything.

> Only time will tell whether the inevitable evolution of intellectual property law to account for AI tools favors individual artists or large tech corporations.

In the end, its going to be “large tech corporations” vs. “large media corporations”, with both sides citing individual artists (in one case, their users, and in the other, their suppliers) to present the clash of titans as a David vs. Goliath story with themselves cast as David.

> If you remove artist names from the training data, you lose the ability to describe those styles

Do you?

With base SD1.5, “by greg rutkowski” produces extremely similar results to “detailed realistic muted fantasy”. Except for the case where what you are actually trying to do is directly mimic the work of an artist with a strong unique style that is very heavily represented in the source data (Rutkowski, IIRC, has extremely little representation in the SD training data, leaving aside question of whether the other parts of that description apply), I don’t think you are losing the ability to refer to styles without artist names.