|
|
|
|
|
by javcasas
330 days ago
|
|
So the XOR initial trick is: use a hash to partition the data into batches so that each batch has up to 1 missing element. Can't we use this again? I mean: 1. Partition the data so that some batches have up to 1 missing element. 2. Recover the elements where possible with the XOR trick. 3. Pick another hash function, then repeat finding more missing elements. 4. Repeat until no more missing elements. |
|
1. Find a batch with 1 missing element 2. Delete that element from its other assigned partitions 3. Repeat, as the modified batches may now be recoverable
This iterative process (surprisingly!) succeeds with very high probability as long as the number of partitions is 1.22x larger than the number of missing elements with k=3 hash functions.