| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by chirdeeps 94 days ago

The biggest thing that surprised us: the constraint shifts from intelligence to reliability the moment agents start modifying shared systems. In a PoC, a wrong action is a failed experiment. In production, it's a corrupted customer record, a duplicated invoice, or a deployment that can't be unwound.The specific properties you need before going live that most frameworks don't give you out of the box: 1. Idempotency — can every agent action be safely retried without duplicating side effects?

2. Rollback semantics — if a multi-step workflow fails, what unwinds?

3. Authority boundaries — what can each agent do without human approval, and what requires sign-off?

4. An authoritative action history — when something goes wrong, can you reconstruct exactly what happened and why, without stitching together logs from five different systems?

Most teams discover these requirements after the first production incident. The teams that define them in an execution layer before going live have a much smoother transition. Keen to check out what you will be sharing in the session.