Hacker News new | ask | show | jobs
A benchmark of expert-level academic questions to assess AI capabilities – HLE (nature.com)
2 points by tufo 110 days ago