Hacker News new | ask | show | jobs
by techbro92 395 days ago
Cuda optimization actually doesn’t suck that much. I think NSight studio is amazing and super helpful for profiling and identifying bottlenecks in kernels
1 comments

Totally, NSight is great. We do something similar: generate kernels, profile them on real GPUs, then optimize based on that:D