Hacker News new | ask | show | jobs
by krasin 441 days ago
They seem to be showing very decent performance results for diffusion transformers. Not so much for the autoregressive transformers (the "regular" ones).