Hacker News new | ask | show | jobs
by npinsker 918 days ago
By its nature, that site isn't very representative of how the models perform in real-world use.
2 comments

That depends on what real world use you're targeting, but unfortunately I'm not aware of anything better than that leaderboard in terms of sample size and model coverage.
The ELO leaderboard you mean?