Hacker News new | ask | show | jobs
by brutusurp 1166 days ago
Just tested it out. The Kandinsky 2.1 model does render high-quality images accurate to input prompt. Each iteration of a prompt produces more robust interpretation; exciting. Still working through the relationship between the positive and negative prompts.

There is a bit of a gap in understanding metaphor and reverse polarity. For instance if I include in the prompt "move away from negativity" Kandinsky produces images of sad faces (e.g., mouth curved down, downward-focused eyes). However I expected there to be an inferred "towards positivity" interpretation (e.g., eyes closed/looking forward or upward, and mouth relaxed/smiling). Trying to see how much this outcome can be achieved leveraging the negative prompt field.

Overall I'm happy with the results and will continue to use it. Thanks for sharing!