Hacker News new | ask | show | jobs
by acrosync 3300 days ago
Naming chunks by their hashes is not a new idea, but this technique along does not give you a practical backup tool. The deletion of unreferenced chunks becomes a hard problem, and the center piece of lock-free deduplication is the two-step fossil collection algorithm that solves this hard problem.
1 comments

Tahoe-LAFS supports a mark/sweep-style garbage collection algorithm