Hacker News new | ask | show | jobs
by alexpop80 164 days ago
What do you mean? Opus 4.5 and GPT 5.2 broke the 80% mark and no other models yet seem to be passing this important milestone.