Hacker News new | ask | show | jobs
by __loam 966 days ago
Aren't these things supposed to have cores dedicated to ml?
2 comments

You’re thinking of the neural engine. I’m not sure that llama.cpp makes use of this. They’d have to turn it into a CoreML model to do so.
They are not as fast as the GPU (but much lower power).

Also, not many implementations can even use it.