|
|
|
|
|
by botw
3538 days ago
|
|
Isn't this approach similar to or used in DeepMind's AlphaGo where the policy network is corresponding to high-level representational knowledge(kind of expert system in traditional AI), and the value network is corresponding to the decision making part(reinforcement learning)? |
|