-
PASCAL Visual Object Classes Challenge
The PASCAL Visual Object Classes Challenge (VOC) is a benchmark dataset for object detection and semantic segmentation. -
COCO object detection and instance segmentation, ADE20K semantic segmentation
The dataset used in the paper is the COCO object detection and instance segmentation dataset, and the ADE20K semantic segmentation dataset. -
SBD dataset
The SBD dataset is a benchmark dataset for semantic segmentation and object detection. -
COCO 2017 Dataset
The COCO 2017 Dataset is a large-scale benchmark dataset for object detection, semantic segmentation, and instance segmentation. -
CAE v2: Context Autoencoder with CLIP Target
Masked image modeling (MIM) learns visual representation by masking and reconstructing image patches. Applying the reconstruction supervision on the CLIP representation has been... -
CamVid Dataset
CamVid dataset is a benchmark dataset for semantic segmentation. It consists of 700 images with 11 object classes. -
Pascal VOC 2012
The dataset used in the paper is the Pascal VOC 2012 dataset, which is a benchmark for instance segmentation. The dataset consists of 1464 images with 20 class categories and... -
ImageNet-1K, ADE20K, and COCO 2017
The dataset used in the paper is ImageNet-1K, ADE20K, and COCO 2017. -
COCO Stuff
COCO Stuff dataset is an extension of the COCO dataset, 164,000 images covering 171 classes are annotated with segmentation masks. -
ND-MLS dataset
The bottle, grass, cat, and horse datasets were created for semantic segmentation tasks. The datasets contain images of 4 object types. The ND-MLS dataset was evaluated on... -
PASCAL Context
The PASCAL Context dataset is a benchmark for multi-task learning in computer vision. It contains 10103 images with 5 tasks: semantic segmentation, human body part segmentation,... -
PASCAL VOC 2007
Multi-label image recognition is a practical and challenging task compared to single-label image classification. -
ImageNet, MS COCO, and Pascal VOC datasets
The dataset used in the paper is ImageNet, MS COCO, and Pascal VOC datasets. -
Argoverse-HD, Cityscapes, and nuScenes
The dataset used in the paper is Argoverse-HD, Cityscapes, and nuScenes.