Hacker News new | ask | show | jobs
by pixel_popping 141 days ago
It's lacking the best model (Opus 4.5) on the benchmark tho.
1 comments

Yeah but then their own product might not score the highest.
Exactly why I'm pointing it out, which feels a bit corrupt, but understandable.
tbh i was a bit cranky yesterday - even if they are #2 on a legit benchmark that would be impressive