| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by zozbot234 49 days ago
	It's not nerfed, it's natively trained at that quantization a.k.a. Quantization Aware Training.

1 comments

QAT typically uses BF16/FP32 during the training process to simulate lower precision.