|
|
|
|
|
by costrouc
3067 days ago
|
|
For those wondering the lapack guys are working on a new version of lapack to run on heterogeneous architecture including gpus. See their work at http://www.icl.utk.edu/research I have worked with these guys and all I can say is good luck outperforming the highly optimized routines they have written. My bet is that the the guy writing this blog used a non optimized version of lapack. |
|
The performance is completely expected--in fact, I would expect an optimized 5x5 solver to be faster than this.