This is so interesting. Because it can't be this hard to do this kind of physics simulation at the correct level of fidelity if you want to apply RL to physical problems
What does RL have to do with this? The laws governing gravity are already well understood and specialized code will always be more computationally efficient while having less unexplainable behavior. Why would you use a slower, more opaque method to accomplish the same thing?