Hacker News new | ask | show | jobs
SWE-Bench: Can Language Models Resolve Real-World GitHub Issues? (arxiv.org)
2 points by t0e 974 days ago