Hacker News new | ask | show | jobs
by burmecia 2542 days ago
I don't think the encrypt-then-dedup is a safe way to protect data privacy. In this case, identical blocks need to produce same cipher text, this will actually leak your data pattern even though it is encrypted. A better way I think is using randomly-seeded derived keys to encrypt each block, thus the identical blocks' cipher text will always be different.
1 comments

Yes, if that's more important to you than dedup savings, then you should definitely do that.