Hacker News new | ask | show | jobs
Show HN: A Framework for Evaluating Coding Agents on Sequential SWE (arxiv.org)
1 points by tdchaitanya 77 days ago