Hacker News new | ask | show | jobs
by ogrisel 1436 days ago
There is also this very interesting 2017 paper:

Towards Understanding Generalization of Deep Learning: Perspective of Loss Landscapes

Lei Wu, Zhanxing Zhu, Weinan E

https://arxiv.org/abs/1706.10239

I think it was the first paper to study the volume of the basins of attraction of good global minima and used the poisoning scheme to highlight the frequency of bad global minima that are typically not reachable found via SGD on the original dataset without poisoning.