| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by declaredapple 842 days ago
	Yeah the output pricing I think is really interesting, 150% more expensive input tokens 250% more expensive output tokens, I wonder what's behind that? That suggests the inference time is more expensive then the memory needed to load it in the first place I guess?

2 comments

flawn 842 days ago

Either something like that or just because the model's output is basically the best you can get and they utilize their market position.

Probably that and what you mentioned.

link

brookst 842 days ago

This. Price is set by value delivered and what the market will pay for whatever capacity they have; it’s not a cost + X% market.

link

declaredapple 842 days ago

I'm more curious about the input/output token discrepancy

Their pricing suggests that either output tokens are more expensive for some technical reason, or they're trying to encourage a specific type of usage pattern, etc.

link

brookst 842 days ago

Or that market research showed a higher price for input tokens would drive customers away, while a lower price for output tokens would leave money on the table.

link

BeetleB 842 days ago

> 150% more expensive input tokens 250% more expensive output tokens, I wonder what's behind that?

Nitpick: It's 50% and 150% more respectively.

link