Hacker News new | ask | show | jobs
by kradroy 2518 days ago
I'm dying for this. My team builds ML models on text corpora. Most of this data is sensitive. My company has very strict data privacy policies and it's a pain to even share the data with other teams in the department. I've made it part of my long-term goals to facilitate secure sharing of sensitive data across the organization. Numerical data seems to be the easiest to anonymize (randomized response, etc), but I have yet to find any techniques for text other than generating synthetic data.
1 comments

Hi, I've been replying to other people in this thread. I work at NuCypher doing some research and cryptography engineering. I work on Proxy Re-Encryption and Fully Homomorphic Encryption.

Do you mind sending me an email with your use case and needs? I'd love to have a chat with you.

john@nucypher.com

This guy right here, this guy knows whats up. gl john.