Hacker News new | ask | show | jobs
by int_19h 1142 days ago
When people tried 3-bit quantization for 7B models before, it did not exactly go well in terms of detrimental side effects. Are you using some new quantization techniques that mitigate that?