Hacker News new | ask | show | jobs
DeepSWE: Measuring coding agents on original, long-horizon engineering tasks (deepswe.datacurve.ai)
2 points by sss111 24 days ago