Hacker News new | ask | show | jobs
by cubefox 83 days ago
No because the base model from which the distilled or quantized models are derived is larger.