|
|
|
|
|
by zanussbaum
591 days ago
|
|
you're definitely right, 80% was a bit of an overestimation, especially with respect to CUDA it would be cool to see if there's some way to get better access to those lower-level primitives but would be surprised it does seem like subgroup support are a step in the right direction though! |
|