Hacker News new | ask | show | jobs
The Benchmark Saturation Problem: Why AI Evaluation Needs Systems Thinking (distributedthoughts.org)
2 points by TheIronYuppie 273 days ago