Hacker News new | ask | show | jobs
by marcosdumay 1108 days ago
By the freshness of training with some data?

Well, aren't they? I believe any kind of reinforcement learning is supposed to be biased into the last training set.