Hacker News new | ask | show | jobs
by antupis 2355 days ago
changing softmax to something else might fix that when there is a limited number of good moves softmax is far from optimal.