Y
Hacker News
new
|
ask
|
show
|
jobs
by
im3w1l
1520 days ago
It can be related to compression. If some phrase has a probability p_i of occuring, then the optimal length for the code is -log(p_i). The entropy sum(-p_i log_pi) = mean(-log(p_i)) is how long code you will use on average.