|
|
|
|
|
by PoignardAzur
1041 days ago
|
|
> I think you may be missing the extensive lines of research covering those topics. Memorization vs Generalization I meant this specific analysis, that neural networks that are over-parameterized will at first memorize but, if they keep training on the same dataset with weight decay, will eventually generalize. Then again, maybe there have been analyses done on this subject I wasn't aware of. |
|
Do you have a link to a specific post you're thinking of? It's likely going to be a Tishby-like (the classic paper from 2015 {with much more work going back into the early aughts, just outside of the NN regime IIRC}: https://arxiv.org/abs/1503.02406) lineage, but I'm happy to look to see if it's novel.