|
|
|
|
|
by zuminator
563 days ago
|
|
That was also true of quite a lot of early CGI, but most people would say that things have improved. I think we're on the cusp of rapid improvement in AI video as well, in part spurred on by skilled people using the tools we currently have. I came across the following recently. I think a casual viewer would assume it was just Bakshi-style rotoscoped animation without a major AI component. https://www.youtube.com/watch?v=X9BG6yBkOIE |
|
Very much no throughline of concepts from one shot to the next. You never see the same character twice. No foreground dynamic action.. not even simple walking except one far-away character directly away from the camera which means that their silhouette hardly changed.
This all comes from the current generation of video diffusion models that basically just generate an image like they always have except with a hint of temporal coherence they expand that into a short shot with no types of movement except those seen a million times in their training set.
Getting gen models to be able to reason better about motion and to build mental world models of the 3d scene they are managing a 2d window into is going to be a big challenge, and require some additional breakthroughs on a par with the original GPT and stable diffusion breakthroughs that currently act as a foundation to a majority of modern AI innovation.