MS COCO dataset

The MS COCO dataset is a large benchmark for image captioning, containing 328K images with 5 caption descriptions each.

Data and Resources

Cite this as

Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Devan Ramanan, Piotr Dollár, C Lawrence Zitnick (2024). Dataset: MS COCO dataset. https://doi.org/10.57702/6k0o15qm

DOI retrieved: November 25, 2024

Additional Info

Field Value
Created November 25, 2024
Last update December 2, 2024
Defined In https://doi.org/10.48550/arXiv.2307.07370
Citation
  • https://doi.org/10.48550/arXiv.1909.00126
  • https://doi.org/10.48550/arXiv.1711.05954
  • https://doi.org/10.48550/arXiv.2007.01496
  • https://doi.org/10.48550/arXiv.2302.08476
Author Tsung-Yi Lin
More Authors
Michael Maire
Serge Belongie
James Hays
Pietro Perona
Devan Ramanan
Piotr Dollár
C Lawrence Zitnick
Homepage https://cocodataset.org/