2. Reinforcement Learning an introduction, Sutton & Barto
3. For the implementation: CleanRL is pretty neat for understanding
4. David Silver's RL playlist