Hacker News new | ask | show | jobs
by bmelton 477 days ago
https://digitalspaceport.com/how-to-run-deepseek-r1-671b-ful...
1 comments

Is that a CPU based inference build? Shouldn't you be able to get more performance out of the M3's GPU?
Inference is about memory bandwidth and some CPUs have just as much bandwidth as a GPU.