|
|
|
|
|
by netcoyote
94 days ago
|
|
> What's stopping your agent from overwriting an arbitrary source file (e.g. index.js) with arbitrary code and running it? You're absolutely right :) And even if it could be sandboxed at the source code level, what's to prevent a nefarious AI from writing an executable file directly as bytes that calls (e.g.) `unlink`? |
|
But yeah, a truly adversarial agent needs a sandbox. It's a different threat model - nah is meant to catch the trusted but mistake-prone coding CLI, not a hostile agent.