Hacker News new | ask | show | jobs
by mr_octopus 126 days ago
Most GPU work boils down to a few patterns — map, reduce, scan. Each one has a known way to assign threads.

So instead of writing a kernel with thread_id:

  let c = gpu_add(a, b)
  let total = gpu_sum(c)
The thread indexing is still there — just handled by the runtime, like how Python hides pointer math.