|
|
|
|
|
by adamgold7
120 days ago
|
|
Prompt guardrails are theater - they work until they don't. We ended up building sandboxed execution for each agent action. Agent proposes what it wants to do, but execution happens in an isolated microVM with explicit capability boundaries. Database writes require a separate approval step architecturally separate from the LLM context. Worth looking at islo.dev if you want the sandboxing piece without building it yourself. |
|