Hacker News new | ask | show | jobs
by snrji 2609 days ago
For linear models it can be shown to be equivalent to weight decay. For nonlinear ones, it empirically behaves as a regularizer.,