|
|
|
|
|
by vardump
2686 days ago
|
|
No, it's faster because the working set of 64 * 64 * 4 * 2 bytes can (almost) fit in CPU core L1 cache. Further cache levels are slower and finally the memory is glacially slow. WASM example would speed up as well using the same approach. Or C, Rust or whatever. |
|