|
|
|
|
|
by viraptor
902 days ago
|
|
> There is no logical meaning to the sum of the size of a compressor and its output that I can see. There is if you have a time limit. Otherwise you could spend a week computing a better result and embed that directly into the compressor - for use only if you're competing that Wikipedia file. |
|
And it doesn't matter if all of Wikipedia is embedded in the compressor — if you can reconstruct the original using only the decompressor and the compressed data, the process can be as expensive as you like.
That's the exciting part.
It's also a vague connection to what's exciting about LLMs, regarding the amount of information they can reproduce with comparatively small size of their weights. But since decompression must be lossless, it's unclear to me if this approach could really help here.
Edit: all of this is already addressed in the FAQ at http://prize.hutter1.net/
Especially the paragraph "Why lossless compression?" is interesting to read.