|
|
|
|
|
by satvikpendem
1280 days ago
|
|
> Is it possible to write caption manually? sure, but that doesn't scale much and won't make it possible to train general models. Maybe, I don't think so however based on the above comments by Unstable Diffusion. It seems like people are underestimating the power of high quality data and just throwing the kitchen sink at models. Perhaps a set of good quality data can indeed outperform Laion-style datasets. It's like the YC saying about doing things that don't scale, perhaps with the high quality dataset, we can train better models than CLIP and in turn use those to caption the rest of the images, only now the caption model is much better than previous ones. |
|