Y
Hacker News
new
|
ask
|
show
|
jobs
by
bcatanzaro
178 days ago
The Nano model isn’t pretrained in FP4, only Super and Ultra are. And posttraining is not in FP4, so the posttrained weights of these models are not native FP4.