Hacker News new | ask | show | jobs
by xrd 820 days ago
Any CUDA primer you recommend in particular? I had this same question.
2 comments

Here's an article on syncing in CUDA via cooperative groups: https://developer.nvidia.com/blog/cooperative-groups/

There's also explicit warp synchronization, i.e. __syncwarp(). More on warp primitives here: https://developer.nvidia.com/blog/using-cuda-warp-level-prim...

Probably https://www.youtube.com/watch?v=nOxKexn3iBo (or just skimming the attached colab).
This is terrific, thanks!