| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by ryeguy_24 112 days ago
	Curious on why you think this. Any data points that led you to this?

1 comments

howdareme 112 days ago

The benchmarks they released

link

johnfn 112 days ago

What do you mean? In most cases, the benchmarks show a larger number for Muse and a smaller number for Opus.

link

spprashant 112 days ago

In Multimodal yes, but Opus is definitely edging out in Text/Reasoning and Agentic benchmarks.

I think the general skepticism is because they are late to race, and they are releasing a Opus-4.6-equivalent model now, when Anthropic is teasing Mythos.

link