Hacker News new | ask | show | jobs
by Kim_Bruning 1212 days ago
At any rate, "elephant with blue skin" is a fairly basic prompt. You can go into excruciating detail with prompts spanning many many paragraphs, specifying lighting, perspective, positioning, colors, clothing, styles, patterns, etc etc etc.

Within a given domain, if you are very exacting with your prompting and start doing things like adding in weights, you can do things like eg. create portraits of your friends, or of characters you've imagined, or basically make things that you can either see or imagine.

There's a new problem though. You can easily make a picture of 1 thing/person/concept, but if you have more than one, they share the prompt.

Say you have an ever so slightly modded Stable Diffusion checkpoint, and say you want a picture of a

  1boy with short_hair, 1girl with (long_hair:1.3)
Now these instructions are fighting each other. If you run a batch, you'll get

* a few images with what you thought you wanted (a boy with short hair and a girl with long hair),

* in some cases you'll get a boy and a girl both with long hair (because the long_hair:1.3 overrides)

* and in some cases you'll get a single person, who happens to have long hair (because of the long_hair:1.3) , and who is more likely to be female (because of implication: long hair was more commonly associated with females in the original dataset).

Of course, if you're smart, you might think to exploit this ambiguity on purpose.

On the gripping hand, this is obviously insufficient control.

So now we're getting newer systems that allow you to chain/layer prompts and apply different prompts with different strengths to different domains of your illustration. This means that you can have eg. your male character on one side, the female character on the other side, and deal with things like room decoration, the view from the window, and specific props - all with separate prompts.

And this is just applications of txt2img. There's also things like img2img, inpainting, and... well... new tools are showing up every week it seems.

If this hasn't crossed back into the creativity sphere already, it probably will by next month, and else the month after. (By manner of speaking)