ZeroVL dataset

The dataset used for training the ZeroVL model, consisting of 14.23M image-text pairs from various domains.

Data and Resources

Cite this as

Quan Cui, Boyan Zhou, Yu Guo, Weidong Yin, Hao Wu, Osamu Yoshie, Yubo Chen (2024). Dataset: ZeroVL dataset. https://doi.org/10.57702/qyyog76k

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Defined In https://doi.org/10.48550/arXiv.2112.09331
Author Quan Cui
More Authors
Boyan Zhou
Yu Guo
Weidong Yin
Hao Wu
Osamu Yoshie
Yubo Chen
Homepage https://github.com/zerovl/ZeroVL