Hacker News new | ask | show | jobs
by aaronjg 3393 days ago
This sort of approach is also used in reinforcement learning https://arxiv.org/abs/1702.01182