|
|
|
|
|
by ogogmad
1520 days ago
|
|
One of the properties of entropy H(X) of a random variable X is that if f is a bijective function then H(f(X)) = H(X). For relative entropy (or "KL divergence" as some people call it), we have that H(X||Y) = H(f(X)||f(Y)). But if you fix Y to have a continuous uniform distribution, then you lose this critical property because f(Y) may no longer have a continuous uniform distribution. |
|
Why would they care about arbitrary transformations mapping points in the space to other points in the space?