|
|
|
|
|
by llm_nerd
1107 days ago
|
|
The neural engine on all recent Apple silicon (and A## devices) has "tensor" cores for matrix calculations (note: Apple abstracts all of this behind coreml so there is some conflation between the ANE and AMX instructions/hardware). The M2 Ultra offers 31.6 trillion ops per second with fp16, for instance, which actually bests an A100. The software support is terrible, of course, which is the biggest limitation, but Apple clearly wants to be in that realm as well. |
|