Hacker News new | ask | show | jobs
by rumblefrog 578 days ago
You can compare models in LM Arena across numerous models & categories (ex: coding), with confidence interval and user votes: https://lmarena.ai/?leaderboard