|
|
|
|
|
by GlenTheMachine
998 days ago
|
|
Ehhh. Solving any one problem with robotic manipulation isn’t all that hard. It takes a lot of trial and error, but in general if the task is constrained you can solve it reliably. The trick is to solve *new* tasks without resorting to all that fine tuning every time. Which is what Russ is claiming here. He’s training an LLM with a corpus of one-off policies for solving specific manipulation tasks, and claiming to get robust ad hoc policies from it for previously unsolved tasks. If this actually works, it’s pretty important. But that’s the core claim: that he can solve ad hoc tasks without training or hand tuning. |
|
In my opinion that really is a good definition of intelligence, and puts this technique at the forefront of machine intelligence.