Hacker News new | ask | show | jobs
by forgot-my-pw 3125 days ago
The improvements from AlphaGo -> AG Master -> AG Zero is by adding more generalization techniques and rely less on human intervention/data. AlphaGo Zero learns only from self training.

AGZ probably can be retrained to other board games, but the hardware cost to train is quite expensive. The estimated cost to train AGZ (for 40 days?) was $25M.

1 comments

The AGZ algorithm is picked particularly for the sort of game that Go is.