Hacker News new | ask | show | jobs
by Nition 1489 days ago
Nice. Latent-diffusion has come out very traditional but the VQGAN/CLIP ones are fairly original.
1 comments

From my experiments, the LD one doesn't seem to have been trained on as big or as tagged data set - there's a whole bunch of "in the style of X" that the VQGAN knows* about but the LD doesn't. That might have something to do with it.