|
|
|
|
|
by curioussquirrel
46 days ago
|
|
V4 is definitely a step-up from V3.2 on our multilingual benchmarks. Two caveats:
- when inferring through Openrouter, we've had a lot of issues with very slow speeds (TPS) and an occasional instability. I just checked and it's still 10-30 TPS on all available providers, which is not a lot for a model that likes to think as much as DeepSeek does. - the official DeepSeek API makes no guarantees of data privacy even for paying users. Both points could be moot with using it through Azure AI foundry (the latter is, afaik); I have yet to test that. In any case, happy to see more open-weights models that are somewhat competitive with SOTA models! |
|