Y
Hacker News
new
|
ask
|
show
|
jobs
by
direwolf20
152 days ago
It only takes 50 poisoned documents to make an LLM training algorithm spit out wrong results on a specific topic, and 250 can make it produce complete gibberish.
https://www.anthropic.com/research/small-samples-poison