Y
Hacker News
new
|
ask
|
show
|
jobs
by
behnamoh
217 days ago
the only good-enough model I still use it gpt-oss-120b-mxfp4 (not 20b) and glm-4.6 at q8 (not q4).
quantization ruins models and some models aren't that smart to begin with.