Interesting - I've been digging into Claude Code's behavior patterns too, but from the cost side. The variance in how many API calls a task takes is wild. A simple refactoring can be 3 calls or 30 depending on codebase state. Curious what tools it suggested — did any of them relate to reducing unnecessary loops?
I'm never sure whether to believe llms about themselves.
Anthropic create developer containers with a selection of tools installed. Is that a better guide? Some of the config seems aimed at human developers but if claude likes the tools why aren't they in this list?
https://github.com/jahala/tilth