This is previous work. Bolt3D uses the same principle, of predicting a per-pixel Gaussian splatting representation but it also trains a diffusion model, which is only feasible if you have substantial compute available.
Given that it's a work done at Google I will not expect them to release source code. But it will be reproduced by someone else soon enough.
Apparently you can clone and run the demo locally. But wasn't clear at a glance how much is local and what hardware required.