Hacker News new | ask | show | jobs
by kmike84 3628 days ago
We're using it for web crawling: define what to look for (a reward function), and crawler can learn how to get these pages from the web without wasting too much HTTP requests for irrelevant content. No neural nets, just Q-Learning with linear function approximation, with some common tricks like double learning and experience replay.