| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by alexbecker 263 days ago
	I'm working on _prompt injection_, the problem where LLMs can't reliably distinguish between the user's instructions and untrusted content like web search results. Just published a blog post a few minutes ago: https://alexcbecker.net/blog/prompt-injection-benchmark.html

1 comments

Good post. Thanks for sharing. I enjoyed it as much as I enjoyed your anime list. I agree on many.