I don't get the mystery of this. This algorithm is complex. SURE! But deep learning is very fast training / repeatition of a game (or some other goal) while saving the good or bad results. Predict user moves. Find good positions/patterns. Or did i miss some here?
I'd be interested in how strong it would be if given the same constraints as human learning (playing thousands of games, rather than millions).