Hacker News new | ask | show | jobs
by c0g 695 days ago
What’s the bottleneck? Once I’ve got the model and data onto the GPU my only cost is launching CUDA kernels right?

Not sure if that blog post is relevant, but even if it is it shows a 3060 gets //way// faster throughout than the igpu it is testing. I suppose I can test this myself by plugging my 3070 into the NVME on my desktop.