|
|
|
|
|
by Veedrac
2361 days ago
|
|
Could you be more explicit? What about the naïve approach to training (same graph but backwards, computing gradients) is going to fail? Wrt. matmul, if you couldn't split them up, today's AI accelerators wouldn't work full stop. But regardless, even if it was much more complex on CS-1 than on all the other sea-of-multipliers accelerators, it's obviously a problem they've solved and so irrelevant to the compilation issue. |
|