|
|
|
|
|
by remontoire
1106 days ago
|
|
Watching geohot code a general matrix multiply algorithm from 0.9 GFLOPS and optimising it to 100 glops by only tinkering with cache locality, it makes me wonder how much effort should be put into single threaded performance before ever thinking about multi threading |
|
1: https://www.factorio.com/blog/post/fff-204