|
|
|
|
|
by jbhuang0604
989 days ago
|
|
The UI part is basically a way to organize the user's intention. In the backend, we develop method for extracting "token maps" (i.e., which spatial regions correspond to specific words) and use region-based diffusion to achieve these localized editing results. The second half of the video provides an overview of the method. https://www.youtube.com/watch?v=ihDbAUh0LXk |
|