| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by redox99 64 days ago
	> If it slightly beats or even matches Opus 4.6 It doesn't though

1 comments

ryeguy_24 64 days ago

Curious on why you think this. Any data points that led you to this?

link

howdareme 64 days ago

The benchmarks they released

link

johnfn 64 days ago

What do you mean? In most cases, the benchmarks show a larger number for Muse and a smaller number for Opus.

link

spprashant 64 days ago

In Multimodal yes, but Opus is definitely edging out in Text/Reasoning and Agentic benchmarks.

I think the general skepticism is because they are late to race, and they are releasing a Opus-4.6-equivalent model now, when Anthropic is teasing Mythos.

link