Hacker News new | ask | show | jobs
by qingcharles 385 days ago
Note that it isn't being created from whole cloth, it is trained on videos of the places and then it is generating the frames:

"To improve autoregressive stability for this research preview, what we’re sharing today can be considered a narrow distribution model: it's pre-trained on video of the world, and post-trained on video from a smaller set of places with dense coverage. The tradeoff of this post-training is that we lose some generality, but gain more stable, long-running autoregressive generation."

https://odyssey.world/introducing-interactive-video

2 comments

I recognized the Santa Cruz Beach Boardwalk channel. It was exactly as I remember.
Could probably be (semi?)automated to run on 3d models of places that doesn't exist. Even ai-built 3d models.
The paper they’re basing this off already does this.

https://diamond-wm.github.io/

What's the point? You already have the 3d models. If you want an interactive video just use the 3d models.