Y
Hacker News
new
|
ask
|
show
|
jobs
by
int_19h
1142 days ago
When people tried 3-bit quantization for 7B models before, it did not exactly go well in terms of detrimental side effects. Are you using some new quantization techniques that mitigate that?