|
|
|
|
|
by dist-epoch
65 days ago
|
|
For those wondering where is this practically relevant - this is the basic metric used to compare quantization of various LLM models - what is the KL divergence of a 4-bit quantization versus an 8 bit one versus the original 16 bit one. |
|