| HN Mirror

For 4. Overall, the goals are similar. Both are trying to optimize over the game to try and reach a winning state. But the methods are different.

The Deep Blue algorithm just does a search over the tree of possible moves (up to a certain depth), called alpha-beta search, and picks the best position it can find according to a simple evaluation function.

Searching over all these nodes is effective because chess is not random. If it could go over all possible chess positions, it could find the perfect moves to play every time. It doesn't need any training. The problem is that this search grows exponentially as depth increases, and you need to search up to high depths because chess is complex.

For a long time Stockfish, a powerful chess engine, used only a highly optimized alpha-beta search to perform at a very high level.

Now, there is the approach of reinforcement learning for chess like AlphaZero, which is very similar to this reinforcement learning approach.

And there is a mix between the two, like the current Stockfish, which uses a neural network trained on data along with alpha-beta search.