Hacker News new | ask | show | jobs
by Tenobrus 2826 days ago
No, it's not. It's actually e ^ x_i / sum(e ^ x_j for x_j in x), which is in fact different. Simply dividing by the sum wouldn't work for "squashing to a probability distribution" in a large number of cases.
1 comments

So pointwise exponentiation composed with dividing by the sum. Still don't need a new word.