Y
Hacker News
new
|
ask
|
show
|
jobs
by
slopinthebag
80 days ago
Claude Code gets smoked on benchmarks by an agent that has a single tool: tmux. So I think they might actually like that quite a bit.
1 comments
HarHarVeryFunny
80 days ago
What benchmarks are you referring to?
link