Hacker News new | ask | show | jobs
by 9q9 2251 days ago
The 6 requirements you list for doing a dot-product on the GPU can be phrased in abstract as a constraint solving problem where the number of thread blocks, the cost of communication etc are parameters.