Hacker News new | ask | show | jobs
by rtrgrd 293 days ago
The blog mentions checking each agent action (say the agent was planning to send a malicious http request) against the user prompt for coherence; the attack vector exists but it should make the trivial versions of instruction injection harder