|
|
|
|
|
by Ericson2314
377 days ago
|
|
https://news.ycombinator.com/item?id=44280505 I think that thead might help? Total layman here, but maybe some tasks are "uniform" despite being "deep" in such a way that poor samples still suffice? I would call those "ergodic" tasks. But surely there are other tasks where this is not the case? |
|
There are situations where states increase at much slower rates than exponential.
Those situations are a good fit for Q learning.