| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by throwawai123 457 days ago
	I am one of the co-authors of the original AgentDojo benchmark done at ETH. Agent security is indeed a very hard problem, but we have found it quite promising to apply formal methods like static analysis to agents and their runtime state[1], rather than just scanning for jailbreaks. [1] https://github.com/invariantlabs-ai/invariant?tab=readme-ov-...