Hacker News new | ask | show | jobs
by simonh 2198 days ago
Alphago didn't learn from zero either. It has a pre-processor that identifies sets of patterns with known features, and also:

"AlphaGo was initially trained to mimic human play by attempting to match the moves of expert players from recorded historical games, using a database of around 30 million moves".

2 comments

That's for an earlier system (which also used less compute).

AlphaGo was followed by AlphaGo Zero (which is the topic of this article) which did not use the process that you describe, it used only the rules of the game and the winning condition.

Oops, my mistake. Thanks for the correction.
AlphaGo != AlphaGo Zero