|
|
|
|
|
by himata4113
9 days ago
|
|
Anthropic and openai has the most efficient tokens per unit of compute on the planet and honestly that's their current moat. They're able to serve tokens at half the cost of any opensource provider. Here's the costs to serve opus 4.7 in china on aws according to one of my connections that operates an enterprise account in the region: Input: $0.257
Output: $1.286
Cache read: $0.0257
Cache write: $0.322
And I have zero doubts that using batching and other optimizations that subscription users are being served at an even lower cost. Most of their expenses likely come from training as we're far into the diminishing returns terriority. We will know once anthropic is required by law to report these numbers so there's no point in continued speculation that "anthropic is losing $9 for every $1" because 1: unless there's some subsidies going on it's not true and 2: we will be told directly from anthropic what the numbers are in the near future. |
|