Hacker News new | ask | show | jobs
by gcp 3402 days ago
It's not so much reduced math performance (it's twice faster than Bulldozer!), it's that Intel chips as of Haswell have dual 256-bit FPUs per core.

If you use them, the Intel chips downclock (sometimes severely) in order not to violate their TDP, but the dual FPUs are still there, and it's still a win for GEMM. I can see why AMD didn't follow along here, but it could be a factor in some small spaces - when you need GEMM but can't use a GPU.

Note that Ryzen can split its 256-bit FPU into two 128-bit units, so on code that's not using AVX, it's completely on par with Intel.

1 comments

Wow, who uses 256-bit floats?
It doesn't handle 256-bit floats, in this context 256-bit means it can operate on eight 32-bit floats or four 64-bit floats at the same time.
Ah, that's what I thought, I was so surprised when i thought you said they had a 256 bit fpu XD

I'm dumb.

It's interesting that power consumption increases with avx instructions. Do you happen to have a link?