Hacker News new | ask | show | jobs
by SeriousM 54 days ago
The ranking of gold medals only makes sense if all models would gave participate all tests.

DNP = Did not participate

In this regard, kimi got more and better medals than Claude.