Y
Hacker News
new
|
ask
|
show
|
jobs
by
martinald
486 days ago
I find the webdev arena tends to match my experience with models much more closely than other benchmarks:
https://web.lmarena.ai/leaderboard
. Excited to see how 3.7 performs!