Hacker News new | ask | show | jobs
by baldgeek 879 days ago
2 clicks from the Posted Link: "Read Paper", then "Code, Data and Media" tab will get you the dataset used (https://paperswithcode.com/dataset/ucf101)
2 comments

That's not the dataset used for training. From the paper:

>We train our T2V model on a dataset containing 30M videos along with their text caption. [...] We evaluate our model on a collection of 113 text prompts describing diverse objects and scenes. The prompt list consists of 18 prompts assembled by us and 95 prompts used by prior works (Singer et al., 2022; Ho et al., 2022a; Blattmann et al., 2023b) (see App. B). Additionally, we employ a zero-shot evaluation protocol on the UCF101 dataset >

Well in the Ai/ML era maybe “models or gtfo” is better. Training data is just common crawl for half these LMs.