|
|
|
|
|
by kristjansson
708 days ago
|
|
I mean they're building an API to abstract away some of the SKU-to-SKU differences, but the broader point cuts the other way, I think: > In fact, more broadly we believe we should really reorient our ideas of AI around what maps well onto the hardware. How big should a recurrent state be? As big can fit onto an SM. How dense should the compute be? No less so than what the hardware demands. An important future direction of this work for us is to use our learnings about the hardware to help us design the AI to match. The value is in adapting the implementation (either manually at write-time or programmatically at run-time) to the specifics of the hardware. Also, great line: > And we ask: if your matrix multiply is smaller than 16x16, are you sure what you’re doing is AI? |
|