Y
Hacker News
new
|
ask
|
show
|
jobs
by
cbo100
507 days ago
I get the right answer on the 8B model too.
It could be the quantized version failing?
1 comments
ein0p
506 days ago
My models are both 4 bit. But yeah, that could be - small models are much worse at tolerating quantization. That's why people use LoRA to recover the accuracy somewhat even if they don't need domain adaptation.
link