Y
Hacker News
new
|
ask
|
show
|
jobs
by
GartzenDeHaes
1180 days ago
It's interesting to me that LLaMA-nB's still produce reasonable results after 4-bit quantization of the 32-bit weights. Does this indicate some possibility of reducing the compute required for training?