|
|
|
|
|
by my123
1689 days ago
|
|
Throughput is not an issue on the M1, with 4x 128-bit SIMD units. Neon is certainly not a bad SIMD ISA, it's a quite orthogonal one. You also have the AMX extension at hand, which is more special purpose but allow to deliver very high throughput. (on a regular M1: 350Gflops DGEMM, 1.2Tflops SGEMM, without leveraging anything other than the CPU) |
|