Y
Hacker News
new
|
ask
|
show
|
jobs
by
yzh
1038 days ago
I would recommend the course from Oxford (
https://people.maths.ox.ac.uk/gilesm/cuda/
). Also explore the tutorial section of cutlass (
https://github.com/NVIDIA/cutlass/blob/main/media/docs/cute/...
) if you want to learn more about high performance gemm. OpenAI triton is another good resource if you want to write relatively performant cuda kernels using python for deep learning (
https://openai.com/research/triton
)