|
|
|
|
|
by naturalgradient
2903 days ago
|
|
This is a weirdly shallow article containing lots of diagrams and bullet points to just summarize the known points that RL needs a lot of data and needs to learn from scratch. No mention of all the ongoing work in learning from demonstrations, or more generally incorporating any off-policy knowledge. Vague speculations about the philosophy of model free learning. Not really worth the read (as someone working in RL). |
|
Says as much at the end... to be fair we did warn up front "The first part, which you're reading right now, will set up what RL is and why it is fundamentally flawed. It will contain some explanation that can be skipped by AI practitioners." But personally I think the board game allegory is fun and that most people tend to forget the categorical simplicity of Go and Atari games and overhype ; easy to say the main points are not new but the details are important here.