Y
Hacker News
new
|
ask
|
show
|
jobs
by
alexmorley
187 days ago
Does anyone know where their SWE-bench Verified results are from? I can't find matching results on the leaderboards for their models or the Claude models and they don't provide any links.