Y
Hacker News
new
|
ask
|
show
|
jobs
by
tarruda
19 days ago
The official Q4_K_S gguf is quite good and has very good 35 tps generation on a M1 mac studio. Should be much faster on recent Macs, especially M5.
1 comments
SilverElfin
18 days ago
What’s “Q4_K_S gguf” and where do I get it? Is it easy to install and configure on a MacBook?
link
throw1234567891
18 days ago
https://huggingface.co/stepfun-ai/Step-3.7-Flash-GGUF
and you can use ollama:
https://docs.ollama.com/import#Importing-a-GGUF-based-model-...
link