Y
Hacker News
new
|
ask
|
show
|
jobs
by
sebastianconcpt
4 days ago
Better than Qwen3.6-35B-A3B-8bit ?
When I tried glm found it way way slower (omlx as runtime)
1 comments
scottcha
3 days ago
Yes way better. We host both and while qwen3.6 is over 100tps we usually can do glm around that too.
link