|
|
|
|
|
by whilefalse
110 days ago
|
|
Hey, I made this, thanks for posting! It’s purposefully high level and non-technical for a general audience - my theory was that most people who aren’t into tech/AI don’t care too much about training, or how the system got to be the way that it is. But they do have some interest in how it actually operates once you’ve typed in a prompt. Happy to answer any questions or take on board feedback |
|
Right now we are only seeing the denoising process after it's been morphed by the latent decoder, which looks a lot less intuitive than actual pixel diffusion.
If you can't find a suitable pixel-space model, then you can just trivially generate a forward process and play it backwards.