|
|
|
|
|
by reitzensteinm
2173 days ago
|
|
GPUs aren't really arrays of scalar cores. All threads in a warp run in lock step. If one takes a branch they all do, with operations being masked off as needed. It's not all that different conceptually to AVX-512 with mask registers, except the vector size is even larger and of course the programming model differs. |
|