Hacker News new | ask | show | jobs
by yousif_123123 723 days ago
One downside for diffusion based systems (and I'm very noob in this) is that the model won't be able to see it's input and output in the same space, therefore wouldn't be able to do follow-up instructions to fix things or improve on it. Where as an LLM generating html could follow instructions to modify it as well. It's input and output are the same format.
1 comments

Oh? I would think that the input prompt to drive generation is not lost during generation iterations -- but I also don't know much about the architectural details.