| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by ninininino 49 days ago

Yes I think it's hard.

OpenAI has already been proven to be easily gamed through very unsophisticated poisoning (fake information in a web page + an edit to a wiki page pointing at it, fake information in a reddit post), so I'm not sure we shoudl hold up their efforts at data cleaning as a gold standard.

https://www.sei.cmu.edu/blog/data-poisoning-in-ai-models-the...