Hacker News new | ask | show | jobs
by orasis 2435 days ago
The trick here isn’t “accurate” simulation, it’s that they used a bunch of different simulations with randomly perturbed physics and the RL learned policies that worked across these wide range of “realities”.