|
|
|
|
|
by erwannmillon
1048 days ago
|
|
Btw, I did this in pixel space for simplicity, cool animations, and compute costs. Would be really interesting to do this as an LDM (though of course you can't really do the LAB color space thing, unless you maybe train an AE specifically for that color space. ) I was really interested in how color was represented in latent space and ran some experiments with VQGAN clip. You can actually do a (not great) colorization of an image by encoding it w/ VQGAN, and using a prompt like "a colorful image of a woman". Would be fun to experiment with if anyone wants to try, would love to see any results if someone wants to build |
|
A slight nitpick, wouldn't doing diffusion in the latent space be cheaper?