Hacker News new | ask | show | jobs
by ilia-a 12 days ago
Looks like just rebranded DGX in laptop form, the biggest miss is the weak memory speed, 1/2 of the M5 laptop memory speed, and 1/3 of the M3 ultra that is now years old...
2 comments

I'm not sure that's such a bad thing. It's not going to challenge the Apple M5, but if you're specifically looking for something in the "not-Mac" market, having a laptop-sized version of the DGX is probably going to be pretty successful.
Then they better release it within the next two weeks.
What's coming out in two weeks?
The main bus is 300gb/sec, which is on par with MB Pro. MB Max has the 600gb/sec of unified memory (about ~500 or so in practice for token generation) only for the 40 core variant, which is like $7k +, which is ironically more expensive than a dual 3090 card desktop. The 32 core variant which is still wildly expensive is like ~400 gb/sec.

The biggest thing where this will crush Apple is the initial prefill phase. 6000+ cores vs 32/40, + active cooling with fans. For local llm models, this matters quite a bit more than tokens/second.

In the end, neither are really worth it for llm use compared to just building a desktop and just port forwarding over ssh to ollama.

Because of the memory costs lately, I doubt this will be much cheaper. Also this is quite a bit slower than even 4070 let alone *90 Nvidia variants albeit with much lower memory.