Y
Hacker News
new
|
ask
|
show
|
jobs
by
mjaskowski
3745 days ago
Absolutely. Q-learning has this capabilities and a shallow neural network was used back in 1992 to play backgammon, which has a lot of stochasticity. See
https://en.wikipedia.org/wiki/TD-Gammon