Hacker News new | ask | show | jobs
by electroglyph 7 days ago
any divergence (even if the benchmark is better) from full precision is error
1 comments

Just pretend that it is the next step update when training. You didn’t train your model to step=inf, I hope?