| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by cubefox 395 days ago
	Image diffusion models also use transformers nowadays. Here is the original "diffusion transformer" paper: https://arxiv.org/abs/2212.09748 Earlier image diffusion models used U-nets: https://en.wikipedia.org/wiki/U-Net

1 comments

tripplyons 395 days ago

Many U-net based models such Stable Diffusion V1.5 modified the base architecture to include self-attention and cross-attention layers interleaved between convolution layers.

link