Hacker News new | ask | show | jobs
SWE-CI: Evaluating Agent Capabilities in Maintaining Codebases via CI (arxiv.org)
2 points by stepri 98 days ago