| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by drunken_thor 52 days ago
	AI services are only minorly incentivized to reduce token usage. They want high token usage, it makes you pay more. They are going to continually test where the limit is, what is the max token usage before you get angry. All AI companies will continue to trade places for token use and cost as cost increases. We are in tepid water pretending it is a bath pretending we aren’t about to be boiled frogs.

7 comments

jedberg 52 days ago

People said this about AWS too. "Why would they save you money??". It turns out that every time they reduce prices, they make more money, because more people use their services.

AI companies have the same incentive. Make it cheaper and people will use it more, making you more money (assuming your price is still above cost). And of course they have every reason to reduce their on costs.

link

esperent 52 days ago

AWS is notorious for being extremely expensive, so it's not like they became cheap. They just reduce prices from extremely expensive to slightly less extremely expensive, and that makes more people decide to start using it.

Since the price they are charging is still way, way above their operating costs there's no surprise really that they end up making more from small price reductions.

If competition drove them to reduce costs to the point where their operating costs started to be a large factor, the paradox would disappear.

link

zormino 52 days ago

jevons paradox

link

minimaxir 52 days ago

To an extent. That economic incentive stops making sense when a) capacity is an actual constraint and b) Anthropic is not a monopoly and is subject to pressure from competitors who are more user-friendly.

link

estimator7292 52 days ago

I severely doubt it. Token spend translates to real cost for the provider. Each token involves real and expensive compute. They aren't free monopoly money you get billed arbitrarily for. You're paying for electricity and infrastructure involved in generating each token.

Less spend means less real cost to the provider while your flat monthly subscription stays the same price. As well, reducing token use per customer means you can over-subscribe even harder, allowing for more flat monthly subscriptions.

Less tokens = more free capacity = more subscription income.

link

GodelNumbering 52 days ago

I am betting on the fact that people will get increasingly frustrated at closed agent lock-ins. I built (cline fork) and open-sourced https://github.com/dirac-run/dirac with the sole focus on token efficiency expecting that the closed-lock-in vendors will do enough to frustrate their users over time. Looking for contributors

link

y42 52 days ago

That's what I am thinking, too. It sound's like a conspiracy theory, but at the end Anthropic et al benefits from models that don't finish their jobs. I recently read about this "over editing phenomenon". The machine is never done. It doesn't want to.

It's like dating apps. They don't want you to find a good match, because then you cancel the subscription.

link

biglyburrito 52 days ago

Which works fine, right up until China releases a new DeepSeek model that's 85% as capable as an Anthropic or OpenAI premium model but costs a fraction of what either of those US companies are charging.

Speaking of which:

https://www.cnbc.com/2026/04/24/deepseek-v4-llm-preview-open...

link

nananana9 52 days ago

Up to a point. There is incentive when they get to the point where they literally can't serve their userbase and customers start leaving.

link

zzzeek 52 days ago

Well that's why threads like this are important to upvote. On hacker news , they're angry !

link