Natural Image-Text Dataset

The dataset used for training the Vary-base model, containing natural image-text pairs.

BibTex: