Hacker News new | ask | show | jobs
by azinman2 587 days ago
Leaderboards are misleading. Try diff models for YOUR task and you’ll see a wide variety of outputs compared to “official” rankings.
1 comments

Ok, maybe I haven't experimented enough; so for which tasks is Gemini the SOTA?