Hacker News new | ask | show | jobs
by minimaxir 966 days ago
Giving Stable Diffusion constraints forces it to get creative.

It’s the best argument against “AI generated images are just collages”.

1 comments

This is a general result. For example, ChatGPT struggles hard with following lexical, syntactic, or phonetic constraints in prompts due to the tokenization scheme - see https://paperswithcode.com/paper/most-language-models-can-be...

LLMs + Diffusors are super charged when using techniques like constraints, controlnet, regional prompting, and related techniques.