Hacker News new | ask | show | jobs
by dadie 706 days ago
Under privacy I understand that I have to the limits of my ability control over with whom I share information. But the same is true for anyone sharing information with me. If I have to input any data from someone else to a program without their implicit or explicit permission it breaks, at least in my opinion, their privacy. The same is true if I for example forward an email addressed only to me to another person. I think I can not expect others to at least try to not break my privacy if I break theirs. Though I can not control other people and would never desire to do so. And I may break unknowingly their privacy as they may do with mine. We all make mistakes and modern software has become horrendous in telling the user what it does.

I know that email providers are scanning the received and send emails. But that is something both party involved know (or at least should know) beforehand. This is what I mean with implicit permission. People sending me an email can and must expect that my email provider will scan their emails to a certain degree. They can und must expect, that I will store their email on a local device for an indefinite period. But they can also expect that I will not post their emails somewhere on the web or forward it to someone else. If they desire otherwise they may state their desire and if I agree with them I will try to accommodate for that to the best of my ability.

For most cases I know about in which ai is used there is neither an implicit nor an explicit permission involved and just the assumption that the other side is ok with their data being shared with a 3rd party as long as it is with a machine which may or may not promises to run locally. Having this permission is an assumption I will not make and which I think is wrong.

Hence why in my mind privacy and ai is irreconcilable. Because the only way I can see to have them both is with an incredible unpractical overhead of managing all this implicit and explicit permissions.

Hopes this answers your question. Though I'm not trying to convince you or anyone else.

1 comments

Thank you for replying, but I don't quite understand what you wrote in the context of a writing assistant where there is no sharing in any form with anyone.
Generative LLM's like this often reveal confidential info that they were trained on, and anything provided to them in order to generate other output can be used as future training data which can then be re-iterated to others
The latter can't be true since the input never leaves your computer, it's private information. The comments you're writing right now on the other hand are public information and can and will be used to train an LLM.