|
|
|
|
|
by FrereKhan
1836 days ago
|
|
If you bring activation sparsity into the mix, the advantage of SNN processors over GPUs/TPUs becomes more clear. Loss-gradient-based optimisation approaches are great because they give you a tool to include e.g. sparsity regularisation into the loss. Encouraging sparse activity makes simple linear algebra a poor fit for network activation, and SNN processors a much better fit. |
|
Sparse activations that don't also have a time component (i.e. are sparse in space and time) can be very well implemented without events.
Granted, SNN processors can handle sparse activations better than matrix accelerators. But then again, SNN accelerators might carry lots of SNN overhead that is not required for sparse activations alone.
Edit: A good example for a non-spiking sparse activation accelerator is the NullHop architecture [1].
[1] https://ieeexplore.ieee.org/abstract/document/8421093