Y
Hacker News
new
|
ask
|
show
|
jobs
by
ricardobayes
20 days ago
You can run it, however those low quantized models (iQ2, iQ4, Q2) will very likely underperform the 9B versions at Q6/Q8.
1 comments
kanemcgrath
19 days ago
Something about qwen models hold up really well even at low quants. for most other models anything under q5 is cooked, but on 35B-A3B I can get a lot of things done even at q3_xl. It is definitely better than full precision 9B
link