|
|
|
|
|
by gyrovagueGeist
694 days ago
|
|
Typically no, although BLAS software engineers occasionally write HPC strassen's implementations and papers about them. In my opinion there's a few reasons why they're not common in BLAS libraries: - They're a bit less numerically stable, which used to matter more for common BLAS use cases. Less so these days. - The memory access patterns and algorithm parallelism make it much harder to reach as high a fraction of peak performance as standard GEMM. Matrix size restrictions for recursive mult algorithms is also an issue. |
|