Hacker News new | ask | show | jobs
by lazyjeff 5766 days ago
You have a valid point but I'm guessing they avoided kl-divergence because it's 1) harder to explain 2) it's not symmetric, e.g. kl(asian, white) != kl(white, asian) 3) it needs a smoothing function for comparing distributions where not every element is in both distributions.