Y
Hacker News
new
|
ask
|
show
|
jobs
by
MaxBarraclough
1295 days ago
Or any other highly optimised numerical codebase. From a quick glance at OpenBLAS, it looks like they have a
lot
of microarchitecture-specific assembly code, with dispatching code to pick out the appropriate implementations.
https://github.com/xianyi/OpenBLAS/blob/02ea3db8e720b0ffb3e2...
https://github.com/xianyi/OpenBLAS/blob/02ea3db8e720b0ffb3e2...