|
|
|
|
|
by littlestymaar
296 days ago
|
|
> VLMs are hugely significant. Not only are they great for product use cases, giving users the ability to ask questions with images, but they're how we gather the synthetic training data to build image and video animation models. We couldn't do that at scale without VLMs. No human annotator would be up to the task of annotating billions of images and videos at scale and consistently. Weren't Dall-E, Midjourney and Stable diffusion built before VLM became a thing? |
|