| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by yters 3132 days ago
	It'd only be innovative if they'd discovered a general approach that applies to many problems without tweaking, and even better if it learned from a comparable size problem set as humans do. As it is, even something as generic as AlphaGo Zero is highly customized for the particular problem domain, and requires millions of games.

1 comments

forgot-my-pw 3131 days ago

The improvements from AlphaGo -> AG Master -> AG Zero is by adding more generalization techniques and rely less on human intervention/data. AlphaGo Zero learns only from self training.

AGZ probably can be retrained to other board games, but the hardware cost to train is quite expensive. The estimated cost to train AGZ (for 40 days?) was $25M.

link

yters 3131 days ago

The AGZ algorithm is picked particularly for the sort of game that Go is.

link