Hacker News new | ask | show | jobs
by ta988 475 days ago
You will have to explain to me how.
2 comments

Is that a CPU based inference build? Shouldn't you be able to get more performance out of the M3's GPU?
Inference is about memory bandwidth and some CPUs have just as much bandwidth as a GPU.