Hacker News new | ask | show | jobs
by ben_w 373 days ago
If such tests actually measured the capacity to think and solve problems.

Thing is, LLMs beat humans on the words of such tests (if not the pencil part), and indeed basically all stabdardised tests at every level.