| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by Jlagreen 506 days ago

Ah, I'm sorry you're right a misunderstood your comment.

I agree and Nvidia positions itself for exactly that. See how fast DeepSeek will come to NIM. People are already wondering how well DeepSeek will run on Digits.

Nvidia also offers distilled open models or specific own open models so is indirectly competing in that space as well. But Nvidia isn't in the LLM generator business but in the business of "infrastructure for LLM generators"

Everyone is waiting for GPT5 or another big bang. And because it takes so much people start to think that there is a wall. And there is a wall but that wall could be also compute. Blackwell will show if there is a compute wall because simply put, if a training run with large parameter set on GPT5 takes like 4 months then Blackwell might reduce that to under 1 month with the same amount of GPUs. Getting more GPUs can get that down even more. Imagine the speed up in AI frontier model research if your training times come down 4-5x from new GPU generation and another 2x from getting twice as many.

The nice part with Nvidia is also that the old GPUs don't become obsolete, OpenAI can continue using them for inferencing or even try to use combined architecture training as long as they don't go FP4.

I wouldn't be surprised that at the end of 2025 we will see things which will make DeepSeek and GPT4 as oldschool stuff simply because of the massive compute which Blackwell will deliver this year.