|
|
|
|
|
by ogrisel
1436 days ago
|
|
There is also this very interesting 2017 paper: Towards Understanding Generalization of Deep Learning: Perspective of Loss Landscapes Lei Wu, Zhanxing Zhu, Weinan E https://arxiv.org/abs/1706.10239 I think it was the first paper to study the volume of the basins of attraction of good global minima and used the poisoning scheme to highlight the frequency of bad global minima that are typically not reachable found via SGD on the original dataset without poisoning. |
|