|
|
|
|
|
by kmike84
3628 days ago
|
|
We're using it for web crawling: define what to look for (a reward function), and crawler can learn how to get these pages from the web without wasting too much HTTP requests for irrelevant content. No neural nets, just Q-Learning with linear function approximation, with some common tricks like double learning and experience replay. |
|