Hacker News new | ask | show | jobs
by ammaox 230 days ago
A very large review of AI benchmarks that reveals a worrying trend in their effectiveness and scientific rigor