Hacker News new | ask | show | jobs
MLE-Bench: Evaluating Machine Learning Agents on Machine Learning Engineering (openai.com)
3 points by hlynurd 622 days ago