| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by FartyMcFarter 2197 days ago
	Go champions don't learn from zero. They learn from teachers, books, and playing against each other. This knowledge is built over hundreds, or thousands of years.

3 comments

simonh 2197 days ago

Alphago didn't learn from zero either. It has a pre-processor that identifies sets of patterns with known features, and also:

"AlphaGo was initially trained to mimic human play by attempting to match the moves of expert players from recorded historical games, using a database of around 30 million moves".

link

PeterisP 2197 days ago

That's for an earlier system (which also used less compute).

AlphaGo was followed by AlphaGo Zero (which is the topic of this article) which did not use the process that you describe, it used only the rules of the game and the winning condition.

link

simonh 2197 days ago

Oops, my mistake. Thanks for the correction.

link

joycian 2197 days ago

AlphaGo != AlphaGo Zero

link

hjnilsson 2197 days ago

Yes! So perhaps one way to make the machine more efficient, is by one of pre-programmed “general” models, that can be attuned to a particular problem in a much shorter time?

link

29athrowaway 2197 days ago

And long collaborative study sessions.

link