Hacker News new | ask | show | jobs
by areddyyt 642 days ago
We do quantization-aware training, so the model should minimize the loss w.r.t. the ternary weights, hence no degradation in performance.