|
|
|
|
|
by modeless
859 days ago
|
|
> Other interactions, like eating food, do not always yield correct changes in object state So this is why they haven't shown Will Smith eating spaghetti. > These capabilities suggest that continued scaling of video models is a promising path towards the development of highly-capable simulators of the physical and digital world This is exciting for robotics. But an even closer application would be filling holes in gaussian splatting scenes. If you want to make a 3D walkthrough of a space you need to take hundreds to thousands of photos with seamless coverage of every possible angle, and you're still guaranteed to miss some. Seems like a model this capable could easily produce plausible reconstructions of hidden corners or close up detail or other things that would just be holes or blurry parts in a standard reconstruction. You might only need five or ten regular photos of a place to get a completely seamless and realistic 3D scene that you could explore from any angle. You could also do things like subtract people or other unwanted objects from the scene. Such an extrapolated reconstruction might not be completely faithful to reality in every detail, but I think this could enable lots of applications regardless. |
|