Hacker News new | ask | show | jobs
by jdlyga 35 days ago
It's a bit odd that they're not comparing it against Sonnet
2 comments

I don't think so. They're comparing it to the highest tier available models from Anthropic and OpenAI. Generally speaking, Opus is better than Sonnet in almost every way, so why have the redundancy?
Price to performance?
I think their comparison to how their benchmarks compare to Opus are a great way to show "look at similar benchmarks for a fraction of the cost". If it has Opus benchmarks (I don't actually take benchmarks seriously, but for their comparison purposes) and Sonnet is still more than half the price of Opus, I figure it's close enough where it doesn't matter.
The tweet specifies that the new model is geared towards long-running tasks, which is what you'd use a model like Opus for anyway.