You're currently viewing an old version of this dataset. To see the current version, click here.

Conceptual Captions

Conceptual Captions is a large-scale real-world dataset that contains approximately 3% to 20% mismatched image-text pairs, comprising 3,334,173 images with a single caption each.

Data and Resources

Cite this as

Piyush Sharma, Nan Ding, Sebastian Goodman, Radu Soricut (2024). Dataset: Conceptual Captions. https://doi.org/10.57702/6qvb3f54

DOI retrieved: November 25, 2024

Additional Info

Field Value
Created November 25, 2024
Last update November 25, 2024
Defined In https://doi.org/10.48550/arXiv.1909.11059
Citation
  • https://doi.org/10.48550/arXiv.1908.08530
Version CC152K
Author Piyush Sharma
More Authors
Nan Ding
Sebastian Goodman
Radu Soricut