Hacker News new | ask | show | jobs
by glcheetham 1399 days ago
The AI is moving up the "content ladder" generating text -> now images -> next videos

Smart for Google to invest in this because their business relies on third party content to exist (blogs, webpages, youtube videos)

If they can vertically integrate their business to create the content AND own the discovery algorithms then they officially win the internet

1 comments

Generating videos and 3D models is _much_ more difficult than images. You can’t just train off videos from the internet in the same way, because they don’t have sufficient text labels to understand them like CLIP does.
Oh but they have sound which can be annotated much faster/more efficiently. You also potentially have screenplay but the amount of training data is probably too less and sparse.

FWIW, I don’t think the AI systems will generate a whole video by itself - it’ll be some form of image to image generation where an artist will render a rough sketch of the scene and the AI will fill in the details, frame by frame.

I mean at this point it’s really not a question of if, but when - right?
I wonder if subtitles could be used, so rather than describing the video, you just write a script and it generates video for you. I'm certainly no expert, but it does seem like there's a lot more data there.