FWIW, I chased this demon early-days in my multi-agent orchestrated harnesses. you'll need a state machine. i used locally hosted temporal.io. it's a bit of a PITA to set up but it whips the llama's ass once it's running.
Okey, went through temporal.io and prompted how could it be implemented into our workflow It actually can replace the semaphores, checkpointing, cancellation and even add crash recovery (since it's for free, why not).
But I don't understand what makes it a state machine. Temporal sounds like durable execution enabler. It is better fit to sit on top of our state machine.
But I don't understand what makes it a state machine. Temporal sounds like durable execution enabler. It is better fit to sit on top of our state machine.