| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by gcr 3439 days ago
	GAN training is still spooky mysterious and can easily fail in nonintuitive ways. Sometimes GANs converge or not depending on the random number seed, even with the same hyperparameters.

1 comments

gwern 3439 days ago

I'm not sure about that. The new GAN models over the past 2-3 months, like LS-GAN or WGAN, all seem to train much more stably. I've beaten up on WGAN with all sorts of strange tweaks and hyperparameter settings and while it may not work well, it's never catastrophically diverged on me the way DCGAN would at the drop of a hat.

link

AlexCoventry 3439 days ago

Have you found any good ways to speed it up? The five-fold training on the Critic is very expensive.

link

gwern 3438 days ago

No, not yet. I suspect that increasing the discriminator-only learning rate might help but haven't tried.

link

igul222 3439 days ago

Try removing BN from the critic :)

link