Y
Hacker News
new
|
ask
|
show
|
jobs
The Benchmark Saturation Problem: Why AI Evaluation Needs Systems Thinking
(
distributedthoughts.org
)
2 points
by
TheIronYuppie
273 days ago