Hacker News new | ask | show | jobs
by bassp 520 days ago
You can request up to 1024-2048 threads per block depending on the gpu; each SM can execute between 32 and 128 threads at a time! So you can have a lot more threads assigned to an SM than the SM can run at once
1 comments

Right, ok. So you mean a handful of warps and not like a plethora of them for no reason.