|
|
|
|
|
by groby_b
501 days ago
|
|
I think it's worth carefully pulling apart _what_ DeepSeek is cheaper at. It's somewhat cheaper at inference (0.3 OOM), and about 1-1.5 OOM cheaper for training (Inference costs: https://www.latent.space/p/reasoning-price-war) It's also worth keeping in mind that depending on benchmark, these values change (and can shrink quite a bit) And it's also worth keeping in mind that the drastic drop in training cost(if reproducible) will mean that training is suddenly affordable for a much larger number of organizations. I'm not sure the impact on GPU demand will be as big as people assume. |
|