Hacker News new | ask | show | jobs
by sebastianconcpt 4 days ago
Better than Qwen3.6-35B-A3B-8bit ?

When I tried glm found it way way slower (omlx as runtime)

1 comments

Yes way better. We host both and while qwen3.6 is over 100tps we usually can do glm around that too.