|
|
|
|
|
by goosehonk
2393 days ago
|
|
It's a good thing you measured it :-) Programs that do a little bit of 512x512 FMA mixed in with other stuff will not benefit from AVX-512 but can suffer from the heat it generates, or from the hiccup when the CPU turns the FMA unit on and back off. Codes that can do a lot of 512b FMA consecutively will benefit very greatly, and pay a small penalty (up to 25%) in terms of throughput for everything else. Codes that use non-multiplier stuff that's just marketed as AVX-512, like VBMI2, also benefit greatly and without any penalty. People with AMD CPUs don't get a choice. Hard to see how this accrues to Intel's mistakes column. |
|
AMD isn't relevant in this space AFAIK.