|
|
|
|
|
by mrothroc
59 days ago
|
|
I've been running a multi-agent software development pipeline for a while now and I've reached the same conclusion: it's a distributed systems problem. My approach has been more pragmatic than theoretical: I break work into sequential stages (plan, design, code) with verification gates. Each gate has deterministic checks (compile, lint, etc) and an agentic reviewer for qualitative assessment. Collectively, this looks like a distributed system. The artifacts reflect the shared state. The author's point about external validation converting misinterpretations into detectable failures is exactly what I've found empirically. You can't make the agent reliable on its own, but you can make the protocol reliable by checking at every boundary. The deterministic gates provide a hard floor of guarantees. The agentic gates provide soft probabilistic assertions. I wrote up the data and the framework I use: https://michael.roth.rocks/research/trust-topology/ |
|
Small model and (where still required) human in the loop steps for deterministic workflows can solve a surprisingly large number of problems and don't depend on the models to be consistent or not to fail.
Just invest heavily in adversarial agents and quality gates and apply transforms on intermediate artifacts that can be validated for some dimensions of quality to minimize drift.