Hacker News new | ask | show | jobs
by egman_ekki 922 days ago
I’ve been running Orca 2 13B on M1 Pro with 32GB of RAM with LLM Studio and GPU acceleration quite nicely.

https://huggingface.co/TheBloke/Orca-2-13B-GGUF

2 comments

What kind of tokens/sec do you get?
Ollama runs 13b models just fine on my M2 Air with 16 GB