Hacker News new | ask | show | jobs
by dontwearitout 916 days ago
Diffusion models could actually be implemented with transformers, hypothetically. Their training and inference is what makes diffusion models unique, not the model architecture.