Y
Hacker News
new
|
ask
|
show
|
jobs
by
Firadeoclus
1254 days ago
Note that AMX can achieve roughly double the FLOPS with FP16, and 8 TFLOPS for the GPU is only about 77% of peak. You can do better than that, especially using FP16 90+% is possible (which is >9.4 TFLOPS).