Hacker News new | ask | show | jobs
by kllrnohj 980 days ago
https://developer.nvidia.com/blog/how-access-global-memory-e...

SIMT still expects coalesced memory access that's close together otherwise performance falls off a cliff

1 comments

Yes, but not all thread in the block need to. As long as you fill a cache line you’re good.