Hacker News new | ask | show | jobs
by viraptor 128 days ago
Why not just check on your real tasks? I'm quite happy with the k2.5 and glm5 performance in practice. Whether they also gamed the benchmarks is not as relevant.