Hacker News new | ask | show | jobs
by amunozo 16 days ago
DeepSeek V4 Pro is wonderful and ridiculously cheap, but we are sleeping on MiMo V2.5 Pro, which have the same price (and lower cached price), it's multimodal and it's higher up in most benchmarks. Same thing for MiMo V2.5 vs DeepSeek V4 Flash.
2 comments

> MiMo V2.5 Pro ... lower cached price

At the moment of writing https://news.ycombinator.com/item?id=48343690 MiMo V2.5 Pro had a lower cache hit ratio. From the article:

OSS models, depending on who you use them from, make a huge difference, mostly due to cache-hit rates.

  Model                   Cheapest effectiveInputPrice (Provider)  
  MiMo-V2.5-Pro           0.3720 (Xiaomi) 
  DeepSeek V4 Pro (Max)   0.0560 (DeepSeek)
Could it be that it changed recently, or am I missing something? Both prices are the same https://openrouter.ai/compare/xiaomi/mimo-v2.5-pro/deepseek/...

EDIT: okay I misread it, does this mean that DeepSeek reuses a higher percentage of tokens at cache price that MiMo, am I right?

Correct. According to https://minimaxir.com/2026/05/openrouter-hy3/#llm-economics-... [0] when served by DeepSeek, Cache Read Costs/Input Costs are a very low percentage:

  DeepSeek V4 Pro    0.83%
  DeepSeek V4 Flash  2%
Notice that OpenRouter response caching is not available when account-level ZDR is enforced [1]

[0] https://news.ycombinator.com/item?id=48317294#48317823 [1] https://openrouter.ai/docs/guides/features/response-caching#...

How would you rate mimo against dsv4 pro? What do you work on?