| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by __loam 966 days ago
	Aren't these things supposed to have cores dedicated to ml?

2 comments

You’re thinking of the neural engine. I’m not sure that llama.cpp makes use of this. They’d have to turn it into a CoreML model to do so.

They are not as fast as the GPU (but much lower power).

Also, not many implementations can even use it.