|
|
|
|
|
by openasocket
3195 days ago
|
|
I'm surprised your performance is anywhere near that of standard BLAS implementations. The Golang compiler doesn't have support for explicit SIMD or auto-vectorization, so that's a big performance gain just sitting there. |
|