Hacker News new | ask | show | jobs
by volodia 475 days ago
The short answer is that we do more than one parallel pass over multiple tokens: we iteratively refine them over a few passes to fix incoherences. This can be seen as a generalization of diffusion algorithms that underlie systems like Midjourney or Sora.
1 comments

so if I understand correctly, you remask some tokens that were previously unmasked?