Hacker News new | ask | show | jobs
by valianteffort 969 days ago
Did you read what I said? Anything posted on the internet is public. Just because you think it's private does not make it so.

To that end, you should consider any file on your computer to be public as well if it's connected to the internet, as there are countless ways that data could be exfiltrated.

1 comments

That may well be true, but it's not what the article is about. The article is about individuals removing personal data from GenAI products. GenAI companies in many places have a legal responsibility to facilitate this. Your concern about all internet-connected data being "public" is only a concern if you're dealing with malicious actors, which is not what's at hand here.
And like I have been saying the argument is foolish. You posted something on the internet. That makes it public. It doesn't matter if you thought it was private. Generative AI companies in the US, afaik, do not have any legal responsibility to remove that training data.

There's no difference between an AI being trained on public data, or a human being trained on public data. Likewise there should be no expectation to "unsee" something someone willingly posted in public.