Hacker News new | ask | show | jobs
by sterlind 2009 days ago
That one would be interesting to try with AlphaZero. Reinforcement learning systems tend to have trouble with imperfect information.