Hacker News new | ask | show | jobs
by dlcarrier 35 days ago
What about for 16 GB VRAM? Is Qwen3.5-9B worthwhile?
1 comments

For 16 GB I would look into running Qwen3.6-35B-A3B (MoE) with some layers offloaded to CPU.