Hacker News new | ask | show | jobs
by jbhuang0604 989 days ago
The UI part is basically a way to organize the user's intention. In the backend, we develop method for extracting "token maps" (i.e., which spatial regions correspond to specific words) and use region-based diffusion to achieve these localized editing results.

The second half of the video provides an overview of the method. https://www.youtube.com/watch?v=ihDbAUh0LXk

1 comments

TIL about region-based diffusion. Thanks for the context!