|
|
|
|
|
by bcatanzaro
2149 days ago
|
|
I wrote a library a few years back to do in-place non-square transposition in the GPU register file. The algorithm had a lot of math that needed to be computed for every size matrix. I wrote it in C++03, before any constexpr, so the metaprogramming is intense. One of these days I'll rewrite it in C++20 and most of the code will go away. This code is still in use by the way. https://github.com/bryancatanzaro/trove |
|