|
|
|
|
|
by gnufx
1919 days ago
|
|
The reference BLAS in Fortran is indeed slow. I'm not aware of any tuned version in Fortran. It might be possible to re-write the BLIS structure in Fortran and get reasonable performance on, say, Haswell, but not on SKX, if we talk x86. |
|
Intel on HPCs.