Hacker News new | ask | show | jobs
by mekpro 566 days ago
As a quick estimation, the size of q4 quantized model usually be around 60-70% of the model's parameter. You can preciselly check the quantized model size from .gguf files hosted in huggingface.