Hacker News new | ask | show | jobs
by cubefox 1004 days ago
I wonder how this compares to GAN based text-to-image models like StyleGAN-T. If I remember correctly, GAN models mainly shine at very fast inference, but the same may not be true for training. Also diffusion based models seem to have generally higher quality.