Y
Hacker News
new
|
ask
|
show
|
jobs
by
ammaox
230 days ago
A very large review of AI benchmarks that reveals a worrying trend in their effectiveness and scientific rigor