| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by more_corn 1889 days ago
	"These FP16 operations are very useful for some kinds of calculations." ML for example. If you use 16bit precision you can fit the model in half the memory and your lookups are twice as fast. Newer GPU models offer "mixed precision mode" It takes some doing to get that working in your tooling though.

1 comments

freeone3000 1889 days ago

Less than it used to! Apex now offers automatic mixed precision on cublas operations, so you get a good bit of result even without a significant code change.

link