|
|
|
|
|
by Reubend
476 days ago
|
|
Super cool, and I'd love to play around with this if they release an open source version. Without a full paper, it's a bit hard to understand the full details. Does this essentially replace nucleus sampling with diffusion, or does it change the "core" transformer architecture in a major way? |
|