| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by andy_xor_andrew 409 days ago
	The article mentions AlphaGo/Mu/Zero was not based on Q-Learning - I'm no expert but I thought AlphaGo was based on DeepMind's "Deep Q-Learning"? Is that not right?

1 comments

DeepMind's earlier success with Atari was based on offline Q-Learning