Y
Hacker News
new
|
ask
|
show
|
jobs
by
egman_ekki
922 days ago
I’ve been running Orca 2 13B on M1 Pro with 32GB of RAM with LLM Studio and GPU acceleration quite nicely.
https://huggingface.co/TheBloke/Orca-2-13B-GGUF
2 comments
evnc
922 days ago
What kind of tokens/sec do you get?
link
nicbou
922 days ago
Ollama runs 13b models just fine on my M2 Air with 16 GB
link