Hacker News new | ask | show | jobs
How to Optimize a CUDA Matmul Kernel for cuBLAS-Like Performance: A Worklog (siboehm.com)
1 points by Areibman 24 days ago