MSCOCO

doi:doi:10.57702/xriudzva

You're currently viewing an old version of this dataset. To see the current version, click here.

MSCOCO

MSCOCO is a benchmark dataset for various computer vision tasks, including object detection, instance segmentation, and image captioning. It contains 83k training images, 40k validation images, and 81k test images, each associated with five captions.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Cite this as

Jun Yu, Jing Li, Zhou Yu, Qingming Huang (2024). Dataset: MSCOCO. https://doi.org/10.57702/xriudzva

DOI retrieved: November 25, 2024

Additional Info

Field	Value
Created	November 25, 2024
Last update	November 25, 2024
Defined In	https://doi.org/10.48550/arXiv.1905.07841
Citation	https://doi.org/10.18653/v1/D19-5627
Author	Jun Yu
More Authors	Jing Li Zhou Yu Qingming Huang
Homepage	https://cocodataset.org/