Y
Hacker News
new
|
ask
|
show
|
jobs
by
mekpro
566 days ago
As a quick estimation, the size of q4 quantized model usually be around 60-70% of the model's parameter. You can preciselly check the quantized model size from .gguf files hosted in huggingface.