Hacker News new | ask | show | jobs
by wongarsu 781 days ago
Alternative title: AI leaderboards would be useful if they didn't blindly believe the author's benchmarks, included good baselines, and factored in real cost to run the model (parameter count can be misleading). Pareto curves are a good tool to decide which model is the best for a given price/performance tradeoff and should be used more

But that's not quite as catchy. Great article