You're currently viewing an old version of this dataset. To see the current version, click here.

MSCOCO

MSCOCO is a benchmark dataset for various computer vision tasks, including object detection, instance segmentation, and image captioning. It contains 83k training images, 40k validation images, and 81k test images, each associated with five captions.

Data and Resources

Cite this as

Jun Yu, Jing Li, Zhou Yu, Qingming Huang (2024). Dataset: MSCOCO. https://doi.org/10.57702/xriudzva

DOI retrieved: November 25, 2024

Additional Info

Field Value
Created November 25, 2024
Last update November 25, 2024
Defined In https://doi.org/10.48550/arXiv.1905.07841
Citation
  • https://doi.org/10.18653/v1/D19-5627
Author Jun Yu
More Authors
Jing Li
Zhou Yu
Qingming Huang
Homepage https://cocodataset.org/