Skimming it quickly I didn’t see anything about gpu or vector instruction support as compilation targets. Is this planned? Did I miss something and this is at a higher layer?
We currently compile to C code and use the system Compiler to compile it further. For dense loop nests it does a good job of auto-vectorizing, but we believe there’s good opportunities for doing something custom, knowing the high-level algebraic structure.
taco does not target GPUs yet, and we want to work on it this spring. It’s clearly needed, for example for neural networks
taco does not target GPUs yet, and we want to work on it this spring. It’s clearly needed, for example for neural networks