Opus 4.7 has all the signs of a smaller model distilled from a newer pretraining run... except a smaller price.
Flash 3.5 raised in price pretty meaningfully over Flash 3
GPT 5.4 got a small price bump over gpt-5.3-Codex/gpt-5.2, then gpt-5.5 doubled pricing over gpt-5.4
Even open weights isn't immune: Kimi K2.6 was originally priced higher despite openly being 2.5 + more post-training, same with GLM 5.1 vs 5
-
All while rental prices are spiking month over month, and NVIDIA Inception discounted prices for buying are higher than undiscounted prices for buying 6 months ago...
I run a consumer AI product and the current reality of trying to get compute vs what it was 6-12 months ago is enough to justify it to anyone who has the money.
I think OpenClaw created a mania that was completely unfounded (Apple Silicon is worth dirt compared to literally anything from NVIDIA including consumer GPUs), but the prediction of compute becoming scarce was correct
Not necessarily. The GPU leases Spacex has made are month to month, so they are taking on all of the risk. If demand goes down, they're the ones stuck with the assets.
Opus 4.7 has all the signs of a smaller model distilled from a newer pretraining run... except a smaller price.
Flash 3.5 raised in price pretty meaningfully over Flash 3
GPT 5.4 got a small price bump over gpt-5.3-Codex/gpt-5.2, then gpt-5.5 doubled pricing over gpt-5.4
Even open weights isn't immune: Kimi K2.6 was originally priced higher despite openly being 2.5 + more post-training, same with GLM 5.1 vs 5
-
All while rental prices are spiking month over month, and NVIDIA Inception discounted prices for buying are higher than undiscounted prices for buying 6 months ago...