Hacker News new | ask | show | jobs
by bcatanzaro 178 days ago
The Nano model isn’t pretrained in FP4, only Super and Ultra are. And posttraining is not in FP4, so the posttrained weights of these models are not native FP4.