Y
Hacker News
new
|
ask
|
show
|
jobs
by
scratchyone
409 days ago
I wonder if it benefits because it can attend to individual tokens of the prompt while generating, compared to typical diffusion models that just get a static vector embedding of the prompt.