|
|
|
|
|
by reach-vb
342 days ago
|
|
Hey Simon, VB from Hugging Face here and the person who added the model to MLX and llama.cpp (with Son). The PR hasn't yet landed on llama.cpp, hence it doesn't work OTB on llama.cpp installed via brew (similarly doesn't work with ollama since they need to bump their llama.cpp runtime) The easiest would be to install llama.cpp from source: https://github.com/ggml-org/llama.cpp If you want to avoid it, I added SmolLM3 to MLX-LM as well: You can run it via `mlx_lm.chat --model "mlx-community/SmolLM3-3B-bf16"` (requires the latest mlx-lm to be installed) here's the MLX-lm PR if you're interested: https://github.com/ml-explore/mlx-lm/pull/272 similarly, llama.cpp here: https://github.com/ggml-org/llama.cpp/pull/14581 Let me know if you face any issues! |
|