Hacker News new | ask | show | jobs
by rkwasny 1227 days ago
Why not use an external dictionary?

http://fileformats.archiveteam.org/wiki/Zstandard_dictionary

1 comments

Such split external dictionaries isn't shared between blobs and will have multiple similar/identical entries in a big enough dataset.