Hacker News new | ask | show | jobs
by arnaudsm 574 days ago
https://livebench.ai/ does that, the latest gpt4o underperforms previous versions significantly