| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by JoeDaDude 2197 days ago

> ...learning completely on its own giving it nothing but the rules which is how AlphaGo works...

Not to be too picky, but it was AlphaGo _Zero_ that learned from the rules alone. AlphaGo learned from a large database of human played games: "...trained by a novel combination of supervised learning from human expert games". [1]

AlphaGo Zero, derived from AlphaGo, was "an algorithm based solely on reinforcement learning, without human data, guidance or domain knowledge beyond game rules". [2]

[1] https://www.nature.com/articles/nature16961

[2] https://pubmed.ncbi.nlm.nih.gov/29052630/

1 comments

klipt 2197 days ago

Also AlphaGo Zero never played chess, only go. It was AlphaZero that applied the same framework to other games including chess.

https://en.wikipedia.org/wiki/AlphaGo_Zero https://en.wikipedia.org/wiki/AlphaZero

link