Hacker News new | ask | show | jobs
by planck 6246 days ago
This is also known as the ZIP compression algorithm.
2 comments

But then you might end up not being able to see the forest of data for all the Huffman trees.
It's called "deduplication" in databaseland, it's common strategy for tape backups.