Y
Hacker News
new
|
ask
|
show
|
jobs
by
stonogo
197 days ago
Am I reading this wrong, or does this only support FP16 inputs, and compares its performance against an FP32 solver?
1 comments
Bulat_Ziganshin
196 days ago
They compare HGEMM implementations. At least CUBLAS has HGEMM functions.
HGEMM means half-precision (i.e. FP16) general matrix multiplication
link
HGEMM means half-precision (i.e. FP16) general matrix multiplication