Natural Image-Text Dataset

The dataset used for training the Vary-base model, containing natural image-text pairs.

Data and Resources

Cite this as

Haoran Wei, Lingyu Kong, Jinyue Chen, Liang Zhao, Zheng Ge, Jinrong Yang, Jianjian Sun, Chunrui Han, Xiangyu Zhang (2024). Dataset: Natural Image-Text Dataset. https://doi.org/10.57702/4za9ljzu

DOI retrieved: December 2, 2024

Additional Info

Field Value
Created December 2, 2024
Last update December 2, 2024
Defined In https://doi.org/10.48550/arXiv.2312.06109
Author Haoran Wei
More Authors
Lingyu Kong
Jinyue Chen
Liang Zhao
Zheng Ge
Jinrong Yang
Jianjian Sun
Chunrui Han
Xiangyu Zhang