Hacker News new | ask | show | jobs
by VygmraMGVl 19 days ago
Claude opus 4.6 scores 51.9% on the same benchmark. Microsoft's result is quite good.