Hacker News new | ask | show | jobs
by izzygonzalez 1488 days ago
I made some concept maps of the first parts of the paper. It might help with clarifying some of it.

https://twitter.com/izzyz/status/1525099159925116928

2 comments

Hmm 1.2B params... that's what, roughly 5 GB of VRAM? Surprisingly compact.
This was done so that the IRL robot manipulation tasks could be done fast enough. In the future, we may always need small models mixed with large models for some tasks (e.g., for slow long term planning and fast short term planning), though compute does have a tendency to improve exponentially...
Aha, ok