Hacker News new | ask | show | jobs
by CamperBob2 249 days ago
Not only that, but the notion that GPT-5 will answer those questions with only 2% accuracy seems suspect. Those are exactly the kinds of questions that current models are great at.

Nothing about that page makes much sense.

1 comments

The percentages are added, not averaged. Each category sums to 10%, and the General Knowledge category has 5 equally-weighted subcategories, so 2% is the best possible score you can get in the social science subcategory.

I don't know why they decided to do it this way. It's very confusing.