definitely understand the perspective. I feel like theres some dev work that completely requires me to be single threaded and just think hard, but there other modes where i can jump into whack-a-mole mode and try to work on a bunch of github issues in parallel, or sentry issues, or little ui bugs and the thinking goes more into reviewing plans and prs and i dont have to be as attentive to each individual session
Even with the strongest guardrails, no llm can be left unsupervised. And believing an llm can supervise itself is madness. For every agent you deploy as a “team” you will have multiplied diminishing returns.
I’ve iterated the process hundreds of times and even with a strong specification and tests, an llm can meet the spec and pass all the tests and still build the wrong thing.
This multi-agent workflow idea is worse than web3.