|
|
|
|
|
by ICBTheory
365 days ago
|
|
Correct: Shannon entropy originally measures statistical uncertainty over a fixed symbol space. When the system is fed additional information/data, then entropy goes down, uncertainty falls. This is always true in situations where the possible outcomes are a) sufficiently limited and b)unequally distributed. In such cases, with enough input, the system can collapse the uncertainty function within a finite number of steps. But the paper doesn’t just restate Shannon. It extends this very formalism to semantic spaces where the symbol set itself becomes unstable.
These situations arise when (a) entropy is calculated across interpretive layers (as in LLMs), and (b) the probability distribution follows a heavy-tailed regime (α ≤ 1).
Under these conditions, entropy divergence becomes mathematically provable. This is far from being metaphorical: it’s backed by formal Coq-style proofs (see Appendix C in he paper). AND: it is exactly the mechanism that can explain the Apple-Papers' results |
|
Separately from that, your entire argument wrt Shannon hinges on this notion that it is applicable to "semantic spaces", but it is not clear on what basis this jump is made.