Hacker News new | ask | show | jobs
by whywhywhywhy 689 days ago
Current generation image generators don’t understand text like instructions as you’re trying to do, describing an object then placing it then setting the scene.

It’s more like a giant telescope of many lenses (the latents from the prompts) and you’re adjusting the lenses to bring a possible reality of many into focus.