|
|
|
|
|
by currymj
2853 days ago
|
|
it is weird that everyone uses this relatively old EMD solver. in this case the WGAN doesn’t actually compute the discrete EMD like that. instead it uses some constraints in the optimization process (gradient clipping), which it can be argued make the training objective equivalent in the limit to minimizing continuous Wasserstein distance (between probability distributions). |
|