Hacker News new | ask | show | jobs
by nikonyrh 316 days ago
AMD already has Composable Kernels[1], and supports for example Triton[2]. Then there is also HIP[3], and there are tools to automatically convert from from CUDA to HIP. But since CUDA is the de-facto standard, there is always friction to use something else (unless you need to support also AMD stack).

Making something just CUDA-compatible is non-trivial, and since Nvidia decides its direction and new features then the alternatives would always be lagging behind. Currently there are also major hardware differences between Nvidia and AMD, which may make highly optimized CUDA code inefficient or even buggy.

  [1] https://github.com/ROCm/composable_kernel?tab=readme-ov-file#composable-kernel
  [2] https://github.com/triton-lang/triton?tab=readme-ov-file#triton
  [3] https://github.com/ROCm/HIP?tab=readme-ov-file#what-is-this-repository-for