Hacker News new | ask | show | jobs
by my123 1689 days ago
Throughput is not an issue on the M1, with 4x 128-bit SIMD units.

Neon is certainly not a bad SIMD ISA, it's a quite orthogonal one.

You also have the AMX extension at hand, which is more special purpose but allow to deliver very high throughput. (on a regular M1: 350Gflops DGEMM, 1.2Tflops SGEMM, without leveraging anything other than the CPU)