Y
Hacker News
new
|
ask
|
show
|
jobs
by
drewnick
402 days ago
I see o3 (high) + gpt-4.1 at 82.7% -- the highest on the benchmark currently.