Hacker News new | ask | show | jobs
by fancyfredbot 457 days ago
This page covers the basics which are pretty timeless. You'd want to understand this before learning about any more recent advances.

It doesn't get into state of the art algorithms, for example proximal policy optimisation isn't mentioned although the paper on this was published in 2017 and is probably considered the best algorithm for at least some applications.