Hacker News new | ask | show | jobs
by nateb2022 1 day ago
> With Qwen3.6-35B-A3B-MTP-UD-Q4_K_XL.gguf?

Ahh no I'm using the MLX version, it's about 5-10% faster than GGUFs in my experience.