|
|
|
|
|
by mark_l_watson
2679 days ago
|
|
Nice, another large contribution to the field from Google just a day after Open AI’s paper on better language models and their implications. This is in addition to other nice recent public contributions from Uber, Facebook, Microsoft, etc. I think I understand these huge tech company’s “generosity”: these public contributions to the field probably help in recruiting efforts like salary and fringe benefits do. The field is moving so fast and growing so fast it is difficult to hire talent right now (I manage a machine learning team at a very large company, and at least this is my experience). This paper is claiming a 5000 times increase in performance over previous state of the art techniques. Huge. |
|
These lines of ideas is not new. The main problems associated with it are that it is almost always more computationally expensive (you learn from real and dreamed trajectories) and it is harder to learn as it is susceptible to a kind of exposure bias : Once you have built a model like "the earth is flat", then you will simulate/dream trajectory according to it, diluting the weak evidence you can get from real data telling you that the "earth is round", and so it gets stuck with a wrong model.
The performance gain you refer to is a gain relative to a naive way of doing things i.e. working in pixel space.
Don't get me wrong, I'm a big fan of the model based approach, and every small step in this direction is good as it helps with explain-ability. This paper is one of these nice small steps, but doesn't compare to the gain of previous techniques like experience-replay, or hindsight-experience-replay.