Hacker News new | ask | show | jobs
by adamesque 25 days ago
It's hard to take this piece seriously if he's citing _Ed Zitron's_ math, and equally hard to make the blanket statement that flat-rate plans = "the current AI pricing". But yes, those pricing models were pretty silly and unsustainable.
1 comments

Get back to me when there's an AI company that's actually profitable and we can compare their service and pricing.

Claiming that there's some small subset of their services (like inference per token) that's "profitable" doesn't mean anything when it relies on everything else that company is still paying for. If you could make money from it at current prices - why aren't they?

Otherwise it's just "how much they're willing to subsidize".

On OpenRouter there are 11 third-party inference providers hosting DeepSeek V4 Pro right now, of which 8 are US-based and 7 of those have zero data-retention policies (which I mention to rule out any claims of "oh they're making up the money by logging all your data"). This is a 1.6T-A49B model, so a bit bigger than Sonnet (~2/3rds the size) and a bit smaller than Opus (~3x as large). These third parties are almost perfectly interchangeable via OpenRouter as a marketplace, so they have no incentive to offer any sort of "growth pricing" below costs, and they serve it at $3.48/Mtok out.

Kimi K2.6 is 1T-A32B with a slightly less computationally efficient architecture, and is served at around $3.50/Mtok out by 9 US ZDR providers.

Unless you think that either the generally accepted size estimates for Anthropic/OpenAI models are wildly off or those companies are a lot worse at serving models efficiently, Anthropic and OpenAI are probably making around 5-8x margins on their API costs.

The cost of training new models is of course a major factor not counted here. Depending on how you want to think about that this may or may not make them net profitable. I remember one of those CEOs gave an interview a while back where they described it as a series of independent investments, where each model they train is net-positive in revenue by EOL just from its own inference, but I don't know whether that's still true or not.

Regardless, the point is that if they stopped training new models today, both Anthropic and OpenAI are making incredibly generous profits on their API inference.

The problem with fixating on earnings as you're doing is that it's a bad metric for a growth company. COGS is much more important. What you're doing is setting it up so every growth company is terrible until they've matured into a 20 year old company. That's obviously dumb.
From what I've seen pretty much every company is limited by hardware supply, to the level where's there complaints from current customers about the speed of new customer growth is exceeding their ability to service them properly.

And "growth at all costs" makes sense if there's lock in and you can monetize those "now locked-in users" later - but that doesn't really seem true on the consumer side. It seems pretty trivial to switch out which model and provider on the consumer side.

Any "lock in" has then to be on the model or inference side, and that's still advancing in multiple areas from so many different sources I'm not sure I'm comfortable saying that will also be a "winner takes all" situation either.

My approach is generally "enjoy using it while it's cheap and subsidized, but understand that might not last forever". If it does remain cheap after the subsidies end, great, you can just keep using it. But if it doesn't and you've lost the ability to work without you'll be in for a world of hurt.

Your concern about their business is that demand for their products is growing so stratospherically that they cannot meet that demand easily? I mean that's like an A+ scorecard for any business. Everyone in business would dream of such a scenario. That's called a luxury problem.
But each new customer is still losing money. As I said, subsidized growth only matters if you can recoup those subsidies afterwards - and that's what I'm not sure will be true.

I think the idea of "all growth is good no matter the cost" has been taken to an extreme.

Losing money based on what? COGS? It really sounds like you're hating on a growth company for not being an airliner.
There is probably going to be a quarter or two of profits when the prices dramatically increase. Vibe coding techbros are hooked on the Iron Lung and may not want to get off.

At my work are multiple developers bragging about overnight AI usage to solve problems hands off. Yes they are wasting money and resources but the fad is here. People be vibe coding for now.

In like 6 months when all the costs need to be paid and the prices go up, we will see if these companies stay profitable. But I'm of the opinion that the vibe coding tech bros are more than enough to sustain a short or even medium term profit for these companies. Just on fad-energy alone (see OpenClaw)

The fad probably collapses soon after. I hope anyway, the waste I see is nauseating.

------------

I dunno where this is all going. But I do have faith in human ingenuity still. Things are changing, possibly for the worse, but we need to make the best of it.

The worst of behaviors is wasteful and blatant fraud. There's something useful here though.