Y
Hacker News
new
|
ask
|
show
|
jobs
by
alsima
684 days ago
Definitely not saying multi-agents is all you need for SWE-bench haha. I touch on this at the end of the blog post, where I mention jumps in progress require better base models or tooling.