Hacker News new | ask | show | jobs
by graycat 379 days ago
Fast matrix transpose? Agree for a transposed matrix, just change the indexing arithmetic that converts row i and column j to an offset in the storage for the matrix and then remember that this is a transposed matrix. Some software object semantics could make this easy for other software to use.
1 comments

i think the problem with changing the indexing arithmetic is that you could end up with arithmetic incompatible with vector instructions in hardware that you're hoping to use for parallelism.
> vector instructions

Gee, for the polar decomposition, Gauss-Seidel, etc., looked really hard for those in my IBM PC/XT and couldn't find any!!!