Hacker News new | ask | show | jobs
by daveguy 3020 days ago
They showed that the system could be adapted to learn any two player perfect info game.

Once it has learned Go it won't play chess. Also, it has to be manually adapted.

By this logic minimax has also been shown to do the same:

https://en.m.wikipedia.org/wiki/Minimax

1 comments

Minimax is a general rule on how to optimally play 2 player games. It's about as much of a learning algorithm, and as practical, as "buy low, sell high".

In pure form, it can't be applied to any game where you cannot enumerate every last possible board position (so Chess and Go are out). And with heuristics, those heuristics effectively become the algorithm and they're really complex.

The minimax principle is used in chess and go and any other competitive two player perfect knowledge game. What improves is the selection of moves to evaluate and the quality of the board evaluation. Chess and Go absolutely use minimax with sleight, but important, modifications. Deep learning extensions with Alpha Go were specifically for move selection and board evaluation.

Alpha beta pruning with heuristics is one of the most fundamental extensions of minimax. It is still minimax.

The alpha go system still clearly applies only to perfect information 2 player games, because it is fundamentally extended minimax.

Edit: I'm not saying the heuristic extensions of alpha go, or the reinforcement learning of zero, aren't brilliant and important. That doesn't take away from the basic fact that it is an algorithm for perfect info 2 player games and one of them at a time (even if modifications can adapt it to other games). Heuristics will be fundamental to general intelligence, but alpha go is not general intelligence.