| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by Terretta 1 day ago

With Qwen3.6-35B-A3B-MTP-UD-Q4_K_XL.gguf?

Also, funny lumping the M4 "and" the M5, I find them 15% to 45% different performance, depending.

And for a good deal of work, an M3 Studio Ultra outpaces the M4 and ties the M5 on single work at a time, outpaces both doing multiple work at a time.

1 comments

nateb2022 14 hours ago

> With Qwen3.6-35B-A3B-MTP-UD-Q4_K_XL.gguf?

Ahh no I'm using the MLX version, it's about 5-10% faster than GGUFs in my experience.

link