Hacker News new | ask | show | jobs
OmniCode: A Benchmark for Evaluating Software Development Agents (arxiv.org)
2 points by foma-roje 104 days ago