Hacker News new | ask | show | jobs
by cubefox 97 days ago
Linear architectures are at least heavily used in image diffusion models. More so in fact than in language models.