Hacker News new | ask | show | jobs
by botw 3538 days ago
Isn't this approach similar to or used in DeepMind's AlphaGo where the policy network is corresponding to high-level representational knowledge(kind of expert system in traditional AI), and the value network is corresponding to the decision making part(reinforcement learning)?