Hacker News new | ask | show | jobs
Why Batch Norm Causes Exploding Gradients [2020] (kyleluther.github.io)
1 points by qwertyforce 838 days ago