|
|
|
|
|
by jimfleming
2643 days ago
|
|
My point is that DQN is pretty far removed from the biological equivalent. It's impressive and useful but the main reason it succeeded was not because of some deep insight from neuroscience but because it scaled well (or at least better than alternatives at the time). EDIT: Richard Sutton (largely credited as the grandfather of RL) has written about this recently: http://incompleteideas.net/IncIdeas/BitterLesson.html |
|