| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by gmerc 137 days ago
	This betrays a lack of understanding how inference works. You cannot categorically defeat prompt injection with instructions. It does not work. There are no privileged tokens.

1 comments

lmeyerov 137 days ago

Yep! One of my favorite attacks is just having a very long piece of a text so the LLM becomes unclear what's important and is happy to do something else

link