|
|
|
|
|
by Const-me
590 days ago
|
|
One interesting thing about these newfangled matrix/AI/ML accelerators that’s very rarely mentioned on the internets, they only deliver that many TFLOP because they operate in very low precision. nVidia tensor cores support int8, couple versions of FP16 (BF16 and the standard IEEE one) and FP19 which they call TensorFloat-32. I think Intel AMX only supports int8 and BF16. None of them supports FP32 let alone FP64 input numbers, which makes them completely useless for traditional GEMM stuff. |
|