Hacker News new | ask | show | jobs
by soohyung 2672 days ago
I believe AlphaGo Zero used reinforcement learning. I would say that's quite an impressive application of it.