| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by bbertelsen 96 days ago
	I'd be interested to know when that Opus 4.6 baseline is from given their recent recognition of performance issues. Do you have a paper posted on this review?

1 comments

ozgune 95 days ago

Ack. I took the benchmark results that AI Labs themselves published for their models. So the Opus 4.6 baseline would be from the time that Anthropic released the model.

link