Hacker News new | ask | show | jobs
SqueezeLLM: Lossless 3-bit quantization with improved performance (arxiv.org)
4 points by vegarab 1104 days ago