| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by currymj 2853 days ago
	it is weird that everyone uses this relatively old EMD solver. in this case the WGAN doesn’t actually compute the discrete EMD like that. instead it uses some constraints in the optimization process (gradient clipping), which it can be argued make the training objective equivalent in the limit to minimizing continuous Wasserstein distance (between probability distributions).