|
|
|
|
|
by fwlr
1211 days ago
|
|
It is so unusual to see clear and precise analysis on this topic. I particularly appreciate this turn of phrase, describing what Stable Diffusion does: “pull from an artist-chosen place in its massive table of probabilities to drive the generation of an image.” |
|
That's not how the diffusion process works. You can pick any number of interesting ways to describe it but if they're technically wrong, it doesn't really matter how poetic they are right?
Diffusion models do use random noise.
As I understand it, every 'step' is composed of three parts: a) the previous output, b) the latent generated from the prompt and c) random noise.
As you move further up, the scheduler changes the weights of a, b, and c that get mixed in.
...but from the article:
> The subtle error comes in a misunderstanding about the "randomly generated noise."
It's not an error. You're just focusing on what you want to focus on.
Let's be 100% blunt: The author of an AI art image is pressing the random generator button. Every time. The output is random.
It's not a matter of debate; the initial seed to the diffusion model is random noise.
The prompt guides the diffusion process, which basically denoises the random noise added to the image certainly... but saying there's no random component to it is completely and utterly wrong.