|
|
|
|
|
by JoeDaDude
2197 days ago
|
|
> ...learning completely on its own giving it nothing but the rules which is how AlphaGo works... Not to be too picky, but it was AlphaGo _Zero_ that learned from the rules alone. AlphaGo learned from a large database of human played games: "...trained by a novel combination of supervised learning from human expert games". [1] AlphaGo Zero, derived from AlphaGo, was "an algorithm based solely on reinforcement learning, without human data, guidance or domain knowledge beyond game rules". [2] [1] https://www.nature.com/articles/nature16961 [2] https://pubmed.ncbi.nlm.nih.gov/29052630/ |
|
https://en.wikipedia.org/wiki/AlphaGo_Zero https://en.wikipedia.org/wiki/AlphaZero