Y
Hacker News
new
|
ask
|
show
|
jobs
by
Nivge
405 days ago
TL;DR - the benchmark depends on its specific dataset, and it isn't a perfect representation to evaluate AI progress. That doesn't mean it doesn't make sense, or doesn't have value.