Scale isn’t the only reason. Sometimes you want resource isolation and self-healing, something that is useful if you want a personal swarm of AI agents.
I get how running in a container or vm would help with that, but why would you want to cluster multiple of them? Are you isolating the agents from one another?