|
|
|
|
|
by willvarfar
1067 days ago
|
|
An LLM or, well, any statistical model, is about prediction. As in, given some preceding input, what comes next? One way to measure the accuracy of the model, as in it's "intelligence", is to use the predictions to turn input into all the differences from the prediction; if it's good at predicting then there will be fewer differences and it will compress it. So seeing how well your model can compress some really big chunk of text is a very good objective measure of it's strength and compare it to the strength of others? So a competition is born! :) |
|
The LLM vs a static tree has some interesting oppositions. With a static tree as emitted by a compression alg will probably many times beat an LLM. As it has full knowledge of the whole stream (or in gzips version that window). So it can do things where it can look back and say 'hm the tree I spit out was not that good let me build a better one'. Where as an LLM does not really have that before hand knowledge. Using a pre-cooked LZW tree for all inputs would be more akin to using an LLM.