|
|
|
|
|
by david-gpu
260 days ago
|
|
> It’s all a bunch of multiply and threshold operations. Real-world high-performance matrix multiplication functions do contain branches internally, even on GPUs. If you are ever curious about what that looks like, NVidia maintains an open-source library called CUTLASS. |
|