Hacker News new | ask | show | jobs
by Vuizur 788 days ago
It is impossible to remove personal data ("any information which are related to an identified or identifiable natural person") from the LLM training data.

As far as I understand it ChatGPT and all other similar systems are blatantly violating GDPR, they would have to for example publish their related training data to conform.

I guess the EU authorities don't do anything for now because they don't want to admit that their funny law basically bans all state-of-the-art AI.

(Ok, Openai also broke the law in almost all countries by downloading shadow libraries, but here they at least have more plausible deniability.)