Hacker News new | ask | show | jobs
by fzimmermann89 1786 days ago
So the code looks (apart from pointers) similiar to numba which feels much closer to numpy/pytorch high level code. Are there huge advantages in the triton model compared to numba that I don't see? Or is there a big performance gap? For me numba was always the easiest way to get some new idea running on cuda, and most of the time it was fast enough..

Did anybody find performance comparison between numba and triton?

1 comments

Unlike Numba, Triton operators operate on blocks with explicit load and store of blocks. This is what enables analysis to automate coalescing, shared memory management, etc.

I guess Triton is not for you if Numba is fast enough.