Hacker News new | ask | show | jobs
by jbs789 438 days ago
Source?
3 comments

More informative than the article: https://xcancel.com/blelbach/status/1902113767066103949 cuTile seems to be the NVIDIA answer to OpenAI Triton.
The plethora of packages, including DSLs for compute and MLIR.

https://developer.nvidia.com/how-to-cuda-python

https://cupy.dev/

And

"Zero to Hero: Programming Nvidia Hopper Tensor Core with MLIR's NVGPU Dialect" from 2024 EuroLLVM.

https://www.youtube.com/watch?v=V3Q9IjsgXvA