Hacker News new | ask | show | jobs
by andy_xor_andrew 362 days ago
The article mentions AlphaGo/Mu/Zero was not based on Q-Learning - I'm no expert but I thought AlphaGo was based on DeepMind's "Deep Q-Learning"? Is that not right?
1 comments

DeepMind's earlier success with Atari was based on offline Q-Learning