|
|
|
|
|
by almostgotcaught
221 days ago
|
|
> and now CUDA is all but irrelevant. Lol this is so wrong it's cringe. > There's now so many different and opinionated takes on how you should write high performant accelerator cluster code. I love it. There are literally only 2: SIMT (ie the same as it always was) and tiles (ie Triton). That's it. Helion is just Triton with more auto-tuning (Triton already has auto-tuning). |
|