Hacker News new | ask | show | jobs
by tbalsam 1044 days ago
I would start with Shannon's information theory and the Wikipedia page on L2/the MDL as a decent starting point.

For the first, there are a few good papers that simplify the concepts even further.

1 comments

Sorry, I know what MDL and L2 regularization are, I would like the paper that connects them in the way you mentioned