|
|
|
|
|
by mr_octopus
126 days ago
|
|
Most GPU work boils down to a few patterns — map,
reduce, scan. Each one has a known way to assign
threads. So instead of writing a kernel with thread_id: let c = gpu_add(a, b)
let total = gpu_sum(c)
The thread indexing is still there — just handled
by the runtime, like how Python hides pointer math. |
|