Y
Hacker News
new
|
ask
|
show
|
jobs
by
bt1a
518 days ago
How excellent for a quantized 27GB model (the Q6_K_L GGUF quantization type uses 8 bits per weight in the embedding and output layers since they're sensitize to quantization)