|
|
|
|
|
by paxys
211 days ago
|
|
I'm not quite convinced. You're telling the agent "implement what it says on <this blog>" and the blog is malicious and exfiltrates data. So Gemini is simply following your instructions. It is more or less the same as running "npm install <malicious package>" on your own. Ultimately, AI or not, you are the one responsible for validating dependencies and putting appropriate safeguards in place. |
|
> Given that (1) the Agent Manager is a star feature allowing multiple agents to run at once without active supervision and (2) the recommended human-in-the-loop settings allow the agent to choose when to bring a human in to review commands, we find it extremely implausible that users will review every agent action and abstain from operating on sensitive data.
It's more of a "you have to anticipate that any instructions remotely connected to the problem aren't malicious", which is a long stretch.