Hacker News new | ask | show | jobs
by energy123 40 days ago
Arena only allows very small context sizes, so it's a noisy benchmark for what we care about IRL.