This is just an off the shelf img2depth model run on top of stable diffusion - I don’t think there’s a novel model or research behind this. People have been doing the same thing in colab for a while.
i dont mean this singular method in particular, just the ability to run automatic conversions to 3d or generating 3d assets without needing to hire modelers