It would be easy to push complexity up at the level of Numpy/Pytorch/Tensorflow but it mostly gets hidden
(also a lot of it relies on LAPACK which is Fortran - which kinda works with SIMD better than C/C++)
It would be easy to push complexity up at the level of Numpy/Pytorch/Tensorflow but it mostly gets hidden
(also a lot of it relies on LAPACK which is Fortran - which kinda works with SIMD better than C/C++)