Hacker News new | ask | show | jobs
by drewnick 402 days ago
I see o3 (high) + gpt-4.1 at 82.7% -- the highest on the benchmark currently.