Hacker News new | ask | show | jobs
by muricula 476 days ago
Is that a CPU based inference build? Shouldn't you be able to get more performance out of the M3's GPU?
1 comments

Inference is about memory bandwidth and some CPUs have just as much bandwidth as a GPU.