Hacker News new | ask | show | jobs
by curioussquirrel 46 days ago
V4 is definitely a step-up from V3.2 on our multilingual benchmarks.

Two caveats: - when inferring through Openrouter, we've had a lot of issues with very slow speeds (TPS) and an occasional instability. I just checked and it's still 10-30 TPS on all available providers, which is not a lot for a model that likes to think as much as DeepSeek does.

- the official DeepSeek API makes no guarantees of data privacy even for paying users.

Both points could be moot with using it through Azure AI foundry (the latter is, afaik); I have yet to test that.

In any case, happy to see more open-weights models that are somewhat competitive with SOTA models!