Hacker News new | ask | show | jobs
by llSourcell 345 days ago
no its not lower than text, its higher ROI than text for understanding the physics of the world, which is exactly what videos are better at than text when it comes to training data
1 comments

Does that transfer, though? I'm not sure we can expect its ability to approximate physics in video form would transfer to any other mode (text, code, problem solving etc)
depends on the hyperparams but one of the biggest benefits of a latent space is transfer between modalities