|
|
|
|
|
by Terretta
1 day ago
|
|
With Qwen3.6-35B-A3B-MTP-UD-Q4_K_XL.gguf? Also, funny lumping the M4 "and" the M5, I find them 15% to 45% different performance, depending. And for a good deal of work, an M3 Studio Ultra outpaces the M4 and ties the M5 on single work at a time, outpaces both doing multiple work at a time. |
|
Ahh no I'm using the MLX version, it's about 5-10% faster than GGUFs in my experience.