Hacker News new | ask | show | jobs
by direwolf20 152 days ago
It only takes 50 poisoned documents to make an LLM training algorithm spit out wrong results on a specific topic, and 250 can make it produce complete gibberish. https://www.anthropic.com/research/small-samples-poison