Hacker News new | ask | show | jobs
by pjmlp 875 days ago
Threads groups get exclusive resources in SIMT execution pipelines.
1 comments

And at memory stall they are exchanged with other waiting thread groups.

Just like HT.

Scheduling algorithm is different.

CPUs target low latency (they switch often). GPUs target high troughput (they switch rarely, only when needed).

High troughput algorithms dont have problem with a lot of threads. Low latency algorithms have problem with a lot of threads (they need lot of cache memory because of constant switching).