Hacker News new | ask | show | jobs
by nborwankar 1061 days ago
Shouldn’t this be called Regularized SoftMax? Adding 1 in the denominator looks a lot like a regularization in other ML contexts.