Hacker News new | ask | show | jobs
by ajtulloch 1610 days ago
The GotoBLAS paper (“Anatomy of High-Performance Matrix Multiplication”, https://www.cs.utexas.edu/users/flame/pubs/GotoTOMS_final.pd...) is really a masterpiece.

It’s worth internalizing almost every single detail if you’re an engineer interested in writing high performance numerical codes on modern hardware.