Hacker News new | ask | show | jobs
Psychometrically derived LLM benchmarks: Efficiencies and human-AI comparisons (doi.org)
1 points by alliancedamages 382 days ago