Hacker News new | ask | show | jobs
by lordmauve 20 days ago
We need to see DeepSWE scores. SWE Bench Pro is junk.