|
|
|
|
|
by gnufx
2803 days ago
|
|
I don't know about this specific example, but polyhedral optimization generally, at least, is useful for parallelization of such loops, e.g. http://pluto-compiler.sourceforge.net/. However, serial GEMM is typically most important in HPC, with parallelization at a higher level. (GEMM shouldn't be main memory limited, if that's what you mean.) |
|