Hacker News new | ask | show | jobs
by MaxBarraclough 1295 days ago
Or any other highly optimised numerical codebase. From a quick glance at OpenBLAS, it looks like they have a lot of microarchitecture-specific assembly code, with dispatching code to pick out the appropriate implementations.

https://github.com/xianyi/OpenBLAS/blob/02ea3db8e720b0ffb3e2...

https://github.com/xianyi/OpenBLAS/blob/02ea3db8e720b0ffb3e2...