Conceptual Captions 12M

doi:doi:10.57702/vsbq74bb

Conceptual Captions 12M

The Conceptual Captions 12M (CC-12M) dataset consists of 12 million diverse and high-quality images paired with descriptive captions and titles.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Cite this as

Liangliang Cao, Bowen Zhang, Chen Chen, Yinfei Yang, Xianzhi Du, Wencong Zhang, Zhiyun Lu, Yantao Zheng (2024). Dataset: Conceptual Captions 12M. https://doi.org/10.57702/vsbq74bb

DOI retrieved: December 3, 2024

Additional Info

Field	Value
Created	December 3, 2024
Last update	December 3, 2024
Defined In	https://doi.org/10.48550/arXiv.2210.09996
Citation	https://doi.org/10.48550/arXiv.2403.02677 https://doi.org/10.48550/arXiv.2305.05095
Author	Liangliang Cao
More Authors	Bowen Zhang Chen Chen Yinfei Yang Xianzhi Du Wencong Zhang Zhiyun Lu Yantao Zheng
Homepage	https://arxiv.org/abs/1807.11538