Hacker News new | ask | show | jobs
by alexatkeplar 3111 days ago
We've been doing a lot of thinking about how to support GDPR at Snowplow (Kafka and Kinesis but plenty of other logs and stores) - for our first phase we're just going to support irreversible pseudonymization of tagged PII:

https://github.com/snowplow/snowplow/issues/3472

For later phases, yes user-specific encryption of PII or hashing-with-lookup table are the way to go...

1 comments

I wish you wouldn’t call it irreversible. Every large public claim of that sort has proven false. Consider the Netflix case, where the separate IMDB review dataset allowed reconstruction of pseudonymous movie watching records.

These approaches may help with compliance, but they’re the opposite of real safety.