|
|
|
|
|
by gwern
1495 days ago
|
|
Gato, a Decision Transformer on steroids, is pretty much what you would expect, with the expected RL scaling curves†, if you've been following ML scaling research for the past 2 years. It is, however, still mindblowing to see it in reality. And note that it's only as small (and thus, weak) as it is because they want to run it directly on robots ("We focus our training at the operating point of model scale that allows real-time control of real-world robots, currently around 1.2B parameters"). † https://storage.googleapis.com/deepmind-media/A%20Generalist... looks just like any scaling curve from a text or vision paper... Also submitted at https://news.ycombinator.com/item?id=31355657 |
|
It was a good read, thanks!