| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by ricardobayes 20 days ago
	You can run it, however those low quantized models (iQ2, iQ4, Q2) will very likely underperform the 9B versions at Q6/Q8.

1 comments

kanemcgrath 19 days ago

Something about qwen models hold up really well even at low quants. for most other models anything under q5 is cooked, but on 35B-A3B I can get a lot of things done even at q3_xl. It is definitely better than full precision 9B

link