|
|
|
|
|
by supermatt
698 days ago
|
|
I havent trained any LLMs, so please accept my comment with all the naivety with which it is given - but in the "examples of MINT multimodal documents" graphic at the top of the README, it feels to me as though the labeling (for the images on the left) couldn't be much worse? Is this normal for these datasets? How are we able to build such powerful models with such poor quality data? |
|
Not to say that data quality does not matter, but these noisy sets are still very useful.
[1] https://arxiv.org/abs/1705.10694