It’s already obvious that it will be a scam. Higher benchmark scores and lower cost are two signs that customers are about to get scammed. We saw it with GPT-5.
This actually proves my point because if you read the anecdotes, you will notice a marked decline in performance. The version number goes up but the actual performance declines. The benchmarks can tell any story you want them to.
Is it? It might be possible that it's a scam, but for something to be "obvious" it has to release first.
There are plenty of ways to reduce inference cost for a high-intelligence model. Making sparser weights, for example, can increase the parameter count while reducing the inference cost and time.
I think you are informed by more of an emotional interest than a technical one, here. You've written several such posts and many of them are astronomically unlikely predictions.
Ok but didn’t Karpathy make it clear that we live in the vibe era? I’m inclined to trust vibes more than technical jargon, and boy are the vibes off with what’s been happening!
Claude 3 Opus: $15.00 (Input) / $75.00 (Output) per 1M tokens
Claude 4 Opus: $15.00 (Input) / $75.00 (Output) per 1M tokens
Claude 4.1 Opus: $15.00 (Input) / $75.00 (Output) per 1M tokens
Claude 4.5 Opus: $5.00 (Input) / $25.00 (Output) per 1M tokens