| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by egman_ekki 922 days ago
	I’ve been running Orca 2 13B on M1 Pro with 32GB of RAM with LLM Studio and GPU acceleration quite nicely. https://huggingface.co/TheBloke/Orca-2-13B-GGUF

2 comments

What kind of tokens/sec do you get?

Ollama runs 13b models just fine on my M2 Air with 16 GB