| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by fxtentacle 859 days ago
	"How ML Model Data Poisoning Works" It doesn't. The mentioned Nightshade tool is useless. Does anyone have any example of successful model data poisoning?

1 comments

hikingsimulator 859 days ago

The is a breadth of literature on the topic. I recommend the excellent survey by Baoyuan wu on the topic (mathematical perspective) [1]. For IRL demonstrations, existing cases will of course be rarer, bu they are not impossible as with attacks on Alpaca-7b [2]

[1] https://arxiv.org/abs/2302.09457 [2] https://poison-llm.github.io/

link

fxtentacle 859 days ago

That paper says you need to control "0.1% of the training data size" for a 40% chance for one single injected prompt to fire. So that's millions of images or billions of text tokens for real-world models.

link

talsperre 858 days ago

Exactly. It is very difficult to implement these data poisoning attacks in the wild due to the size of internet data in general.

link

doctorpangloss 859 days ago

Yeah, but the vibes man.

link