Hacker News new | ask | show | jobs
by agent_invariant 111 days ago
Interesting approach. We ended up framing the problem a bit differently, less as “policy checking” and more as commit control.

Instead of validating the model’s output directly, we assume the model can propose anything. The important part is that real-world state changes can’t execute unless they pass a deterministic boundary.

In our experiments the gate sits between the agent and any irreversible action (DB writes, payments, API mutations). The agent proposes an action, the gate checks invariants like replay, sequencing, ceilings, and context consistency, and only then allows the commit.

The nice side effect is that you get a clean append-only ledger of every proposal, rejection, and commit, which becomes extremely useful when agents misbehave.

Prompt guardrails try to make the model behave better. Execution gates assume it eventually won’t and make sure nothing dangerous happens. Have you seen issues where the policy layer itself becomes complex to maintain as agents operate more varied workflows.