Hacker News new | ask | show | jobs
by martinald 486 days ago
I find the webdev arena tends to match my experience with models much more closely than other benchmarks: https://web.lmarena.ai/leaderboard. Excited to see how 3.7 performs!