Hacker News new | ask | show | jobs
by zone411 511 days ago
I just ran my NYT Connections benchmark on it: 18.6, up from 14.8 for Qwen 2.5 72B. I'll run my other benchmarks later.

https://github.com/lechmazur/nyt-connections/