Hacker News new | ask | show | jobs
by gyrovagueGeist 694 days ago
Typically no, although BLAS software engineers occasionally write HPC strassen's implementations and papers about them. In my opinion there's a few reasons why they're not common in BLAS libraries:

- They're a bit less numerically stable, which used to matter more for common BLAS use cases. Less so these days.

- The memory access patterns and algorithm parallelism make it much harder to reach as high a fraction of peak performance as standard GEMM. Matrix size restrictions for recursive mult algorithms is also an issue.

1 comments

One such paper is https://jianyuhuang.com/papers/sc16.pdf; I don't know if there's anything more recent/relevant.