Hacker News new | ask | show | jobs
by juggertao 875 days ago
And at memory stall they are exchanged with other waiting thread groups.

Just like HT.

1 comments

Scheduling algorithm is different.

CPUs target low latency (they switch often). GPUs target high troughput (they switch rarely, only when needed).

High troughput algorithms dont have problem with a lot of threads. Low latency algorithms have problem with a lot of threads (they need lot of cache memory because of constant switching).