Hacker News new | ask | show | jobs
by mlechha 2884 days ago
They're probably the most fundamental kind of reinforcement learning algorithms. Understanding bandit algorithms is crucial to developing a good understanding of RL.