Hacker News new | ask | show | jobs
Measuring AI Ability to Complete Long Software Tasks (muratbuffalo.blogspot.com)
2 points by KraftyOne 56 days ago