Hacker News new | ask | show | jobs
SWE-Bench: The $500B Benchmark (marginlab.ai)
5 points by qwesr123 186 days ago