| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by lelanthran 76 days ago
	> I easily get $1K+ of usage out of my $100 max sub. And that's with Opus 4.6 on high thinking. And people keep claiming the token providers are running inference at a profit.

4 comments

gruez 76 days ago

>And people keep claiming the token providers are running inference at a profit.

Not everyone gets $1K of usage, and you don't know how fat the per-token margins are. It's like saying the local buffet place is losing money because you eat $100 worth of takeout for $30.

link

lelanthran 76 days ago

> Not everyone gets $1K of usage, and you don't know how fat the per-token margins are.

Well, we're going to find out sooner rather than later. Right now you don't know how thin (or negative) the margins are, either, after all.

All we know for certain is how much VC cash they got. Revenue, spend, profit, etc calculated according to GAAP are still a secret.

link

manquer 76 days ago

In addition to usage distribution aspects others called out .

$1K is not actual cost, just API pricing being compared to subscription pricing. It is quite possible that API has a large operating margins, and say costs only $100 to deliver $1K worth of API credits.

link

Computer0 76 days ago

The model developers across the board stand by that most/all models are profitable by EOL, and losses come from R&D/Training.

link

infecto 76 days ago

Yes and when we say things like that we are not talking about plans. Running inference at a profit means api token use is run profitably. It’s a huge unknown what’s happening at the plan level, we know there is subsidy happening but in aggregate impossible to know if it’s profitable or not.

link