| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by Terretta 582 days ago

> OpenAI's flagship models are not even correct 50% of the time[1]

Where does [1] go? In any case, try Anthropic's flagship:

91% > 50.6%