Hacker News new | ask | show | jobs
by aDyslecticCrow 473 days ago
This is simply brilliant.

The moment I heard the synopsis of the technique, I thought of one thing: Style transfer.

This model style should be really nice for translation and style transfer tasks. Takes an existing section text, noises it, and reverses it with guidance like an image diffusion model; A "movement" in latent with a controllable amount of modifications.

The diffusion process enables a wide range of "control" approaches not possible with current transformer models. Perhaps summarizing text can be done differently as well, taking an input and diffusing it into a shorter and shorter section.

I've not been this hyped about a new method since GPT3 itself.