Hacker News new | ask | show | jobs
by alexmorley 187 days ago
Does anyone know where their SWE-bench Verified results are from? I can't find matching results on the leaderboards for their models or the Claude models and they don't provide any links.