|
|
|
|
|
by cavisne
869 days ago
|
|
It's a good question. I think fundamentally its because no wants to/can compete with Nvidia when making a general purpose parallel processor. They all want to make something a bit more specialized, so they need to guess what functionality is needed and not needed. This is a really tricky guess, case in point that AMD's latest chip cant compete on training because they could not get Flash Attention 2 working on the backward pass because of their hardware architecture. [1] Attempts to abstract at a higher layer have failed so far because that lower layer is really valuable, again Flash Attention is a good example. [1] https://www.semianalysis.com/p/amd-mi300-performance-faster-... |
|