Hacker News new | ask | show | jobs
by gnufx 1919 days ago
The reference BLAS in Fortran is indeed slow. I'm not aware of any tuned version in Fortran. It might be possible to re-write the BLIS structure in Fortran and get reasonable performance on, say, Haswell, but not on SKX, if we talk x86.
1 comments

>Tuned.

Intel on HPCs.