Hacker News new | ask | show | jobs
by aianus 379 days ago
You can easily measure the capacity to think and solve problems with paper-and-pencil exams every week or two not hours of daily busywork.
1 comments

If such tests actually measured the capacity to think and solve problems.

Thing is, LLMs beat humans on the words of such tests (if not the pencil part), and indeed basically all stabdardised tests at every level.