Hacker News new | ask | show | jobs
by be7a 57 days ago
242 Elo points clear of the next best model and 93% win rate against random models (96% against nano banana) while Gemini 3.1 (second best) sits at 67%. That’s quite the leap.