The differences are pretty big, but the simplest way to illustrate is to try to use gazebo, isaac...etc, and then try to build a whole physically interactive kitchen.
First off, it's gonna take you 3 months to author that thing, if you don't ragequit along the way.
Then, when you go to run, your "50 million steps per second" sim becomes 500 steps per second.
The reason we have robots doing backflips and acrobatics instead of actually useful stuff like picking up your house is making the scenes and getting the data is tough. It requires sensors like cameras and rendering, vs purely proprioceptive-only envs with a flat ground plane and no other physics interactions.
Right now, the industry is doing manual teleop to collect data because it's straight up easier than trying to build these sorts of things in simulators.
The differences are pretty big, but the simplest way to illustrate is to try to use gazebo, isaac...etc, and then try to build a whole physically interactive kitchen.
First off, it's gonna take you 3 months to author that thing, if you don't ragequit along the way.
Then, when you go to run, your "50 million steps per second" sim becomes 500 steps per second.
The reason we have robots doing backflips and acrobatics instead of actually useful stuff like picking up your house is making the scenes and getting the data is tough. It requires sensors like cameras and rendering, vs purely proprioceptive-only envs with a flat ground plane and no other physics interactions.
Right now, the industry is doing manual teleop to collect data because it's straight up easier than trying to build these sorts of things in simulators.
This is why we're building Lucky.