Gut feeling is that at least about a month ago OpenAI Codex was better at building complete features than Claude Code. The ExecPlan trick made it work independently for longer periods.
I haven't benchmarked different tools against each other in serious manner.
I haven't benchmarked different tools against each other in serious manner.