Hacker News new | ask | show | jobs
by brookst 1208 days ago
Look at the “dataset” column: CLIP was trained on 400m images, UForm on 4m.
1 comments

There are also dataset sizes for Albef and ViCHA.