|
|
|
|
|
by tmjdev
1357 days ago
|
|
This looks like the video equivalent of Dall-E 1. Hard to believe how far we've come so quickly. The paper talks about "pseudo 3D attention layers" that are used in place of temporal attention layers for each dimension due to memory consumption. It seems like AI research is vastly outpacing GPU development. |
|
Even then, these videos are only like 50 frames long - and a real movie you would want to be hundreds of thousands of frames long.