| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by stonogo 197 days ago
	Am I reading this wrong, or does this only support FP16 inputs, and compares its performance against an FP32 solver?

1 comments

They compare HGEMM implementations. At least CUBLAS has HGEMM functions.

HGEMM means half-precision (i.e. FP16) general matrix multiplication