Hacker News new | ask | show | jobs
by zozbot234 1 day ago
It has been quantized to 80GB (2-bit quantization for experts) with limited degradation. Certainly competitive with a 27B model, and especially useful in a size range where few "native" models exist.
1 comments

> (2-bit quantization for experts) with limited degradation. Certainly competitive with a 27B model

Uh-huh...