|
|
|
|
|
by imtringued
21 days ago
|
|
Nvidia has added structural sparsity to their GPUs and every time they pull out a flops or tops number, they assume you will use structural sparsity. The die area argument here makes no sense. Supporting structural sparsity can be done either by duplicating the multipliers with and without the support or you have a single general purpose multiplier that does both, in which case you can have twice as many of them. Also, in ReLU^2 networks, 90%+ parameters are zero. |
|
Any logic you add to the GPU is physical silicon and metal that take up physical space.
> duplicating the multipliers with and without the support or you have a single general purpose multiplier that does both
That would be extra physical logic, which would be extra physical space on the die. "can be done" isn't my point, it's that "doing requires surface area".