|
|
|
|
|
by janwas
239 days ago
|
|
Wow, that number requires STRONG caveats, lest it be called out as completely false.
Take away the tensor cores (unless you only do matmuls?), and an H100 has roughly 2x as many f32 flops as a Zen5 CPU, which is considerably cheaper. I suspect brute force HW/algorithms are not going to age well: https://www.sigarch.org/dont-put-all-your-tensors-in-one-bas...
(/personal opinion) |
|