Y
Hacker News
new
|
ask
|
show
|
jobs
by
sebzim4500
1126 days ago
I'm sure they'd love to have good benchmarks, but there aren't any and realistically if Anthropic invented their own no one would trust it.
1 comments
whimsicalism
1126 days ago
https://lmsys.org/blog/2023-05-10-leaderboard/
link