I don't understand - are people's agents making so many mistakes? I'm using VSCode + Cline + Mimo to refactor big codebases and add features (including payment integrations) and it's rarely making any mistakes.
I use Claude Opus 4.7 on max thinking inside Claude Code and I gotta tell you, as context of the project grows, it starts slipping. No amount of whipping and cursing has helped.
Currently looking to start making my own hooks setup so it can be safer but nothing concrete yet.
Currently looking to start making my own hooks setup so it can be safer but nothing concrete yet.