Hacker News new | ask | show | jobs
Show HN: Zip-sizer – A tool to estimate the compressed size of large archives (github.com)
3 points by arunsupe 450 days ago
Introducing zip-sizer, a small CLI, to estimate compressed sizes of directories. It works by choosing a representative sample of the data, compressing it in memory and forming an estimate. It is designed to work on very large archives. Fast and memory-efficient. Estimates for gzip/bzip2 algorithms. Error ~2.5% in my testing.

Feedback welcome.