Hacker News new | ask | show | jobs
by yters 3132 days ago
It'd only be innovative if they'd discovered a general approach that applies to many problems without tweaking, and even better if it learned from a comparable size problem set as humans do. As it is, even something as generic as AlphaGo Zero is highly customized for the particular problem domain, and requires millions of games.
1 comments

The improvements from AlphaGo -> AG Master -> AG Zero is by adding more generalization techniques and rely less on human intervention/data. AlphaGo Zero learns only from self training.

AGZ probably can be retrained to other board games, but the hardware cost to train is quite expensive. The estimated cost to train AGZ (for 40 days?) was $25M.

The AGZ algorithm is picked particularly for the sort of game that Go is.