COCO-Stuff, Pascal-VOC, and Pascal-Context

The dataset used in the paper is COCO-Stuff, Pascal-VOC, and Pascal-Context. COCO-Stuff is an extensive semantic segmentation dataset comprising 171 categories, encompassing 80 things classes and 91 stuff classes. It contains 117k training images and 5k validation images and it is divided into 156 seen classes and 15 unseen classes. Pascal-VOC consists of 11,185 training images and 1,449 validation images across 20 classes. PASCAL Context provides supplementary annotations for PASCAL VOC 2010, consisting of 4,998 training images and 5,005 validation images.

Data and Resources

Cite this as

Yunheng Li, Zhong-Yu Li, Quansheng Zeng, Qibin Hou, Ming-Ming Cheng (2024). Dataset: COCO-Stuff, Pascal-VOC, and Pascal-Context. https://doi.org/10.57702/b55l74tf

DOI retrieved: December 2, 2024

Additional Info

Field Value
Created December 2, 2024
Last update December 2, 2024
Defined In https://doi.org/10.48550/arXiv.2406.00670
Author Yunheng Li
More Authors
Zhong-Yu Li
Quansheng Zeng
Qibin Hou
Ming-Ming Cheng
Homepage https://github.com/HVision-NKU/Cascade-CLIP