Hacker News new | ask | show | jobs
by smartbit 7 days ago
> MiMo V2.5 Pro ... lower cached price

At the moment of writing https://news.ycombinator.com/item?id=48343690 MiMo V2.5 Pro had a lower cache hit ratio. From the article:

OSS models, depending on who you use them from, make a huge difference, mostly due to cache-hit rates.

  Model                   Cheapest effectiveInputPrice (Provider)  
  MiMo-V2.5-Pro           0.3720 (Xiaomi) 
  DeepSeek V4 Pro (Max)   0.0560 (DeepSeek)
1 comments

Could it be that it changed recently, or am I missing something? Both prices are the same https://openrouter.ai/compare/xiaomi/mimo-v2.5-pro/deepseek/...

EDIT: okay I misread it, does this mean that DeepSeek reuses a higher percentage of tokens at cache price that MiMo, am I right?

Correct. According to https://minimaxir.com/2026/05/openrouter-hy3/#llm-economics-... [0] when served by DeepSeek, Cache Read Costs/Input Costs are a very low percentage:

  DeepSeek V4 Pro    0.83%
  DeepSeek V4 Flash  2%
Notice that OpenRouter response caching is not available when account-level ZDR is enforced [1]

[0] https://news.ycombinator.com/item?id=48317294#48317823 [1] https://openrouter.ai/docs/guides/features/response-caching#...