Hacker News new | ask | show | jobs
by slopinthebag 80 days ago
Claude Code gets smoked on benchmarks by an agent that has a single tool: tmux. So I think they might actually like that quite a bit.
1 comments

What benchmarks are you referring to?