Y
Hacker News
new
|
ask
|
show
|
jobs
by
simjnd
46 days ago
I don't think any models are natively INT4? I wouldn't see the point to nerf the model out-of-the-box.
2 comments
zozbot234
46 days ago
It's not nerfed, it's natively trained at that quantization a.k.a. Quantization Aware Training.
link
pbgcp2026
46 days ago
QAT typically uses BF16/FP32 during the training process to
simulate
lower precision.
link
EntityDeletr
45 days ago
The only model I have seen like that is GPT OSS, natively quantized to MXFP4.
link