Hacker News new | ask | show | jobs
by gwern 3393 days ago
I'm not sure about that. The new GAN models over the past 2-3 months, like LS-GAN or WGAN, all seem to train much more stably. I've beaten up on WGAN with all sorts of strange tweaks and hyperparameter settings and while it may not work well, it's never catastrophically diverged on me the way DCGAN would at the drop of a hat.
2 comments

Have you found any good ways to speed it up? The five-fold training on the Critic is very expensive.
No, not yet. I suspect that increasing the discriminator-only learning rate might help but haven't tried.
Try removing BN from the critic :)