Hacker News new | ask | show | jobs
by salty_biscuits 573 days ago
I'm sure there is a way of interpreting a relu as a sparsity prior on the layer.