| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by ein0p 851 days ago
	You don’t need MLX for this. Ollama, which is based on llama.cpp is GPU accelerated on a Mac. In particular it has better performance on quantized models. MLX can be used for eg fine tuning etc. It’s a bit faster than PyTorch for that.