Hacker News new | ask | show | jobs
by Drakim 821 days ago
I'm surprised this actually works, usually detecting whether to use multiplication or addition is slower than simply using multiplication. Especially if it's massive amounts of work being done in parallel.
1 comments

Wonder how well it compares to openblas and cublas.