|
|
|
|
|
by londons_explore
1357 days ago
|
|
Indeed - it's not hard from a research point of view - it's hard from a compute perspective because adding one more dimension requires hundreds of times more compute. Even then, these videos are only like 50 frames long - and a real movie you would want to be hundreds of thousands of frames long. |
|
We can’t do it. AIs can sort of do it.
Latent diffusion models already demonstrated that operating on a compressed representation gives far better results, faster, but I don’t think we’re anywhere near the limit for what’s possible there. It’s no coincidence that this is how humans work.