Hacker News new | ask | show | jobs
by sharemywin 1727 days ago
Not sure if this is the same thing?

https://github.com/openai/CLIP

1 comments

Not the same. CLIP is trained with pairs of images and texts, whereas VideoCLIP uses pairs of videos and texts.