Hacker News new | ask | show | jobs
by anonkogudhyfhhf 1212 days ago
Parallelism using CPU instructions and vectorization most likely. Threading would still improve performance
1 comments

Multiplying a large enough matrix in Python using MKL for Numpy, I can watch the cpu usage go to 400% in top. You may need to run it in a loop or make the matrices quite large, a surprisingly large amount of computation has to happen before it’ll show up in top.