Hacker News new | ask | show | jobs
by 37ef_ced3 1998 days ago
Well, a GPU is a cluster of SIMD units with fast memory

A GPU thread variable is just like a SIMD lane, and a GPU warp variable is just like a SIMD vector register

Nvidia's SIMD instructions are called PTX and they are similar to AVX-512

An AVX-512 core is like a general purpose CPU with a 512-bit GPU core built in

So paying for a single AVX-512 core is like paying for part of a GPU, plus the general purpose compute you need to keep the GPU supplied with work

If you could divide the GPU up, you would lose most of the parallelism, keep all of the communication latency, and still need the drivers etc.

Would a hypothetical virtualized GPU be competitive with an AVX-512 core in terms of price/performance? I don't know, I haven't done the comparison