Hacker News new | ask | show | jobs
by afavour 120 days ago
> AI in particular doesn't have a high requirement at the consumer side. Everyone already has everything they need to use it.

That isn’t actually true though, right now everyone has a hard dependency on a cloud service. That is currently sold to them at deep discount by companies that are losing billions.

When the market eventually corrects it’ll be interesting to see how much AI ends up costing. At the very least it will be comparable to the broadband internet connection you mentioned. Possibly a whole lot more.

3 comments

>That isn’t actually true though, right now everyone has a hard dependency on a cloud service. That is currently sold to them at deep discount by companies that are losing billions.

Isn't that a huge red flag? If customers are being given this product at a discount and it still isn't showing a positive ROI for them, what makes people think it will improve once we're charged full price?

I think most people just assume it's magic, and are too awestruck by the hype to think critically.

Financially this feels similar to Uber's business plan in the 2010s; undercut the market with unsound pricing propped up by venture capital (PE was literally subsidising taxi fares; they admitted this and their intention to readjust, but no one seemed to care) then stop manipulating the market and allow fares to even out at (gasp) what it cost to get a cab before Uber.

The difference here is that the LLM market is human productivity; enormous subsidies are afforded to Anthropic, OpenAI etc. in the form of VC or compute credit, but eventually those debts will be called in, the free-to-use aspect will vanish because it's simply not profitable, and we'll be left with several premium products that only a few people will actually pay for, and even then that may not be enough to cover their costs. That's when the bubble will burst.

Actually I think there’s another option.

There’s the scenario where LLMs get more efficient in size, and to get 2026 SOTA performance you will be able to get it from consumer grade laptop.

Sure with a 1000B parameter you will get better performance but the average person will have it write some python script, not derive new physics equations.

So in a sense the demand for LLM intelligence with reach a plateau (arguably we are there today for avg person) so there will not be any subsidy required, because the avg person will not need the latest and greatest.

There’s not the same demand pattern for something like uber.

> There’s the scenario where LLMs get more efficient in size, and to get 2026 SOTA performance you will be able to get it from consumer grade laptop.

But isn't that bad for the AI companies, too? Because then people just run an ~2026 SOTA performance open source model on their laptop for free and not pay any subscription.

Yes and no.

Regular folks will not pay Anthropic, but NSA, NASA or research labs might.

I’m not implying this will be a good time for AI companies. I am saying AI as a technology can provide value without it being controlled by only 3 companies.

In a hypothetical future with 2026 level LLMs on a (high end) consumer laptop, I still think that majority of buyers would prefer to pay 20 USD/month for a service. Just for the convenience and flexibility.
Is it actually being sold at a steep discount? Anthropic CEO has stated they have high margins on inference, so training is the big cost center.
> Anthropic CEO has stated they have high margins on inference, so training is the big cost center.

I'm pretty sure that in corpo-speak "inference" excludes the cost of datacenter construction, GPUs and other hardware, manual data cleaning, R&D, administration, etc - basically everything except the power bill for inference.

I have absolutely no problem with companies that run inference only - plenty of them offer open models as a service - they're usefull and their accounting can be believed... but they don't have near $ Trillion valuations and they don't misallocate capital on a vast scale as the frontier models do.

The point of the OP is that closed models don't pay for themselves and, on the scale of the US economy, they provide minuscule economic advantages compared to the enormous investments they consume.

They've raise 70-ish billion (which they have not spent all of) and have a run rate of 14 billion/y as of now. All said and done those are great economics so far, even accounting for those extra expenses.
Your argument requires the run rate to reduce over time until OpenAI reaches profitability. However, even OpenAI has publicized that they expect their expenses to exponentially increase for their models to remain competitive.

So they are not profitable now & they have no idea of when they ever will be.

Worse, Gemini has guaranteed funding for continued training whenever the AI hype bubble pops.

Anthropic & OpenAI's only saving grace is that Google is generally terrible at product.

> Your argument requires the run rate to reduce over time until OpenAI reaches profitability

I was talking about Anthropic, but run rates don't need to go down, they just need to scale with revenue. For Anthropic specifically, this seems to already be the case.

OpenAI I don't know much about, but it would make sense if they were running at a terrible loss due to the ubiquity of free ChatGPT.

> Worse, Gemini has guaranteed funding for continued training whenever the AI hype bubble pops.

I don't see a scenario in which Anthropic has any problem financing their activity given their conversion rate of inputs to recurring revenue. Generally, bubbles popping means companies with bad balance sheets and bad economics die, but that just doesn't apply to Anthropic IMO.

OpenAI though, hard to say. They've lost all of the good will being the first mover gave them at this point, so they'll need to really lead product to make the economics work for them.

> Is it actually being sold at a steep discount? Anthropic CEO has stated they have high margins on inference, so training is the big cost center.

They're spending more than they're making. For the foreseeable future, saying "we could be profitable if we stopped training" if goofy, because they can't stop. If they do, no one will want to use their product because it will be overtaken by competitors within three months.

I get it that in 10 years all of this might peak and we're gonna be content using old models, but that'll be a very different landscape and Anthropic might not be a part of it anymore if they don't start making money before that.

> I get it that in 10 years all of this might peak and we're gonna be content using old models

I would personally be happy using gpt 5.3 codex for the foreseeable future, with just improvements in harnesses

IMO we're already at the point where even if these company collapse and the models end up being sold at the cost of inference (no new training), we would be massively ahead

That's a perfectly valid approach if you can balance capex and revenue. Why stop and try to be profitable when the economy is giving you the liquidity to push that down the road?

Models are already super useful, but if you can make them more useful by burning cash people are willing to hand you, why not?

Well, training isn't going to end soon if these companies keep on competing with one another whilst being neck-and-neck, so I'm not sure why you would ignore the cost of training in the ROI calculation.
Does the cumulative earnings from inference on a single model exceed its training costs?

That’s.. kinda the question.

Amodei says yes - each model pays for its training. But they're scaling up investment for each new run, so they're still happily in the red.

And also that may be the case for Anthropic who have fewer free users, a large enterprise business, and less generous rate limits on their subscriptions. I don't know if OpenAI or Google have commented. I suspect OpenAI is in a worse position given their massive non-paying consumer base.

Then why are they stopping people from having multiple max plans? If they are making such good margins on inference.
They have good margins on inference at API costs, i.e. $5/$25 per mtok input/output. They are almost certainly making losses on subscriptions, at least if people max out rate limits.

In the past 30 days I have burned $78.19 in API token costs with my $20/month Claude Pro subscription. In January I burnt over $300 in API token costs.

Because the power users of the max plan are subsidized at the upper end of usage by people who don’t approach the per account limit. In other words, the power users are getting more than they pay for, because most people don’t reach that threshold. If you let the power users have dozens of accounts, it has a multiple effect on the proportion of accounts breaching the profitability line.
They are likely aiming to maximize reach/mindshare. Get as many people hooked as possible. More important than minor upside from a few multi-Max users.

EDIT: also, the casual or gym-style members that pay every month but barely use the service are of course very valuable wrt margins

>That is currently sold to them at deep discount by companies that are losing billions.

They're not losing billions on inference, they're losing billions in the arms race of training.