|
|
|
|
|
by mendeza
2782 days ago
|
|
Are there any good guides, tutorials, or research papers that investigate or advise how to inspect weights during training for debugging. The only things I read are to watch out for vanishing gradients, and when fine-tuning the most change in layers are seen toward the end of the network, not the beginning layers. |
|
https://arxiv.org/pdf/1706.04454.pdf
https://openreview.net/forum?id=ByeTHsAqtX