Y
Hacker News
new
|
ask
|
show
|
jobs
by
andy_xor_andrew
362 days ago
The article mentions AlphaGo/Mu/Zero was not based on Q-Learning - I'm no expert but I thought AlphaGo was based on DeepMind's "Deep Q-Learning"? Is that not right?
1 comments
energy123
362 days ago
DeepMind's earlier success with Atari was based on offline Q-Learning
link