Hacker News new | ask | show | jobs
by jasonjmcghee 846 days ago
> Genie is capable of converting a variety of different prompts into interactive, playable environments that can be easily created, stepped into, and explored

If these are generating a fully interactive environments, why are all the clips ~1 second long?

Based on the first sentence in your paper, I would have expected a playable example as a demo. Or 20.

But reading a bit further into the paper, it sounds like the model needs to be actively running inference and will generate the next frame on the fly as actions are taken- is that correct?

1 comments

That is correct! The model generates each frame on the fly.