|
|
|
|
|
by mindgam3
2462 days ago
|
|
> They are designed to play one combinatorial game really well. Maybe, but they’re certainly not described that way by whoever is in charge of publishing DeepMind’s research: “A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play” https://deepmind.com/research/publications/general-reinforce... |
|
The learning algorithm AlphaGo uses is somewhat general, and can handle different games (e.g. you can put chess or Go through the algorithm and it functions well for either).
The output of this algorithm, however, is a specialised agent. The agent is not general. If I create a chess agent and give it Go or chess with different rules, it will perform very poorly.
Creating general learning algorithms is arguably a somewhat easier task than creating a general agent, since learning algorithms are typically run for a long time while an agent often has to make time constrained decisions.
The holy grail of AGI is to make the learning algorithm and the agent the same thing, and have them be general. Then you have an agent which can rapidly adapt to its environment and self-modify as needed. We are still a long way off a system that would do this in terms of current research.