Hacker News new | ask | show | jobs
by zanussbaum 590 days ago
at least on my m2, the compiled kernel ends up using fast math anyways so using WGSL's fma didn't change anything about the actual kernel that gets run
1 comments

inglor is probably referring to Strassen or Coppersmith–Winograd.
Last I checked the extra mems really hurt on a lot of cases especially for the more complex ones, but I'm no expert.
oh in that case it was because i didn't know about them :) something to try next!