Hacker News new | ask | show | jobs
ProgramBench: Can Language Models Rebuild Programs from Scratch? (github.com)
3 points by fittingopposite 46 days ago
1 comments

I didn't managed to find the tests. How can we know that the tests are actually reasonable in this case ?