|
|
|
|
|
by throwaway080383
2824 days ago
|
|
Nit: it seems it's more like a smooth approximation to maxarg than max. Yeah it makes sense that this is a super important function, but I still feel like one could just remember the principle that "exponentiation followed by normalization is a smooth approximation to maxarg." |
|