Y
Hacker News
new
|
ask
|
show
|
jobs
by
luoc
942 days ago
You mean something like a CDC algorithm? I know that some Backup tools like Restic use this.
https://en.m.wikipedia.org/wiki/Rolling_hash
1 comments
EdSchouten
942 days ago
You can use a CDC algorith, but if you know that duplication mostly occurs at power-of-two boundaries, there is no need to use that. Deduplicating on a binary tree basis is sufficient.
link