Hacker News new | ask | show | jobs
by __jl__ 34 days ago
This understates the cost increase. 3.5 Flash also uses more tokens. artificialanalysis.ai shows these difference to run the whole eval, which I think is more realistic pricing:

Gemini 2.5 flash (27 score): $172 (1.0x)

Gemini 2.5 pro (35 score): $649 (3.8x)

Gemini 3.0 Flash (46 score): $278 (1.6x)

Gemini 3.5 Flash (55 score): $1,552 (9.0x or 2.4x compared to 2.5 pro)

This is a massive price increase... 5.6x compared to Gemini 3.0 Flash

5 comments

At these pricing levels, corporations who use the models will need to ensure employees are using them efficiently. I know, where I work, we don't really think about the cost to the company when using copilot chat, but sounds like it could start adding up really fast, especially for poorly defined questions that have to be revised multiple times.
the era of subsidised ai is ending
API calls have never been subsidized, only subscriptions.
AI is getting really useful, might be why
It's interesting they use output tokens as an eval because all tokens are not made equal. Even from model to model (like Opus 4.6 to Opus 4.7) the tokenizer can be different and it's no longer an apples to apples comparison. No one really talks about this but it directly affects stats like usage limits. Certainly comparing models between providers on an apples to apples comparison token wise is not a good test.
Sonnet-level performance at Haiku prices. They know what they have and who the audience is they want.
Gemini 2.0 Flash: $19
... and you get what you pay for. Or less.