| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by botw 3538 days ago
	Isn't this approach similar to or used in DeepMind's AlphaGo where the policy network is corresponding to high-level representational knowledge(kind of expert system in traditional AI), and the value network is corresponding to the decision making part(reinforcement learning)?