Hacker News new | ask | show | jobs
by GartzenDeHaes 1180 days ago
It's interesting to me that LLaMA-nB's still produce reasonable results after 4-bit quantization of the 32-bit weights. Does this indicate some possibility of reducing the compute required for training?