Hacker News new | ask | show | jobs
by karmasimida 504 days ago
Why? Serving is still a massive effort, requires massive amount of GPU memory to hold those models.

I don't understand the logic that deepseek somehow is a blow to GPU demand. If anything, more people will try to build on top of R1 style model now, it is only going to drive demand, for customized training.

1 comments

We can buy old chips at any volume. The restriction is only on the latest and greatest.

DeepSeek has shown that you can achieve the same or better result on old hardware with less computing power.

H800 is essentially H100, and it is not old. And GPUs do expire, it breaks down constantly. You need to swap them in and out.

Buying old chips isn't related to deepseek what so ever, you can buy A100 also.