Hacker News new | ask | show | jobs
by wren6991 36 days ago
For 16 GB I would look into running Qwen3.6-35B-A3B (MoE) with some layers offloaded to CPU.