Hacker News new | ask | show | jobs
by more_corn 1841 days ago
"These FP16 operations are very useful for some kinds of calculations."

ML for example. If you use 16bit precision you can fit the model in half the memory and your lookups are twice as fast. Newer GPU models offer "mixed precision mode" It takes some doing to get that working in your tooling though.

1 comments

Less than it used to! Apex now offers automatic mixed precision on cublas operations, so you get a good bit of result even without a significant code change.