Hacker News new | ask | show | jobs
by mhi3 55 days ago
"Published benchmarks are gamed, optimized, and overfit, and no longer yield a useful signal."

Is this true?

But I love this concept!

1 comments

Oh very true. Benchmaxxing itself is basically gaming them.