You're currently viewing an old version of this dataset. To see the current version, click here.

COCO Caption dataset

The COCO Caption dataset, containing 330,000 images with five independent human-generated captions each.

Data and Resources

Cite this as

Xinlei Chen, Hao Fang, Tsung-Yi Lin, Ramakrishna Vedantam, Saurabh Gupta, Piotr Dollár, C Lawrence Zitnick (2025). Dataset: COCO Caption dataset. https://doi.org/10.57702/n2kaqges

DOI retrieved: January 3, 2025

Additional Info

Field Value
Created January 3, 2025
Last update January 3, 2025
Defined In https://doi.org/10.48550/arXiv.2305.02297
Author Xinlei Chen
More Authors
Hao Fang
Tsung-Yi Lin
Ramakrishna Vedantam
Saurabh Gupta
Piotr Dollár
C Lawrence Zitnick
Homepage https://cocodataset.org/