Hacker News new | ask | show | jobs
by zozbot234 49 days ago
It's not nerfed, it's natively trained at that quantization a.k.a. Quantization Aware Training.
1 comments

QAT typically uses BF16/FP32 during the training process to simulate lower precision.