| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by EL_Loco 984 days ago
	I had the same thought. The gothic church one, for example. Why wouldn't I just write "A pink gothic church in the sunset" instead of writing "A gothic church" and then having to do the extra steps to turn the word "church" into pink? Of course, I'm very ignorant of the uses of such tech, so there's probably some usefulness in this.

2 comments

Legend2440 984 days ago

Because at least with current models, the pink-ness would spread to the rest of the image. You'd end up with not only a pink church but a pink sunset.

It's even worse with styles; midjourney can't do a guitar in one style and the rest of the image in another style. You really only get one style per image.

link

90-00-09 984 days ago

The value I see is in constructing more complex prompts. Agree with your example but could see myself using this feature for prompts with multiple objects/aspects that require specific details. Probably not much different from inlining all details, just a nice separation of concerns: you can describe the high level requirement first, and then add and tweak individual details.

link

jbhuang0604 984 days ago

Yes, I think the "footnote" showcases this well. You can use it to interactively explore your visual imagination.

Some examples here: https://youtu.be/ihDbAUh0LXk?si=i3LFfkDXIDKKvne3&t=91

link

90-00-09 983 days ago

Exactly, that's the feature that interested me the most. Ideally, the UI for footnotes would be even more rich: e.g. selecting a word would open a small popup to provide more context.

link

jbhuang0604 982 days ago

Yes! I am particularly excited about this feature.

link