https://deepmind.google/discover/blog/genie-2-a-large-scale-...
this one's entirely about world understanding with physical concepts etc. and less about photorealism, but it's really not hard to imagine a pipeline combining these