|
|
|
|
|
by loup-vaillant
1335 days ago
|
|
> I'm not convinced that AVX 256 and AVX 512 are useful for non-matrix operations. I've observed that compared to regular x86-64 code without SIMD, using AVX 256 speeds up the Chacha20 cipher (for long messages so they can be processed in 512-bytes chuncks (8 blocks)) by a factor of 5. Network packets easily exceed 1KB, and files are usually much bigger. Matrix operations aren't the only viable niche. |
|
https://simdjson.org/