Hacker News new | ask | show | jobs
AlphaSnake: Policy Iteration on a Nondeterministic NP-Hard MDP (arxiv.org)
1 points by kenny239 298 days ago