Hacker News new | ask | show | jobs
by eli 228 days ago
Should be a bit faster if you run an MLX version of the model with LM Studio instead. Ollama doesn't support MLX.

Qwen3-Coder is in the same ballpark and maybe a bit better at coding

1 comments

LM Studio will run dynamic quants from Unsloth too. Much nicer than Ollama.