|
|
|
|
|
by p1esk
2832 days ago
|
|
If you don't undestand where the name "softmax" came from, you don't really understand what it is. Softmax is a differentiable approximation of the max function. Plot max(0, x) and softmax(0, x) functions, and it should become clear. |
|
Yeah it makes sense that this is a super important function, but I still feel like one could just remember the principle that "exponentiation followed by normalization is a smooth approximation to maxarg."