Hacker News new | ask | show | jobs
GPQA and HLE are broken (zenodo.org)
2 points by whwhyb 153 days ago
1 comments