Hacker News new | ask | show | jobs
by declaredapple 842 days ago
Yeah the output pricing I think is really interesting, 150% more expensive input tokens 250% more expensive output tokens, I wonder what's behind that?

That suggests the inference time is more expensive then the memory needed to load it in the first place I guess?

2 comments

Either something like that or just because the model's output is basically the best you can get and they utilize their market position.

Probably that and what you mentioned.

This. Price is set by value delivered and what the market will pay for whatever capacity they have; it’s not a cost + X% market.
I'm more curious about the input/output token discrepancy

Their pricing suggests that either output tokens are more expensive for some technical reason, or they're trying to encourage a specific type of usage pattern, etc.

Or that market research showed a higher price for input tokens would drive customers away, while a lower price for output tokens would leave money on the table.
> 150% more expensive input tokens 250% more expensive output tokens, I wonder what's behind that?

Nitpick: It's 50% and 150% more respectively.