-
Vaihingen dataset
The Vaihingen dataset consists of 1440 scenes with a size of 250×250 pixels. Each scene is a colour-infrared (CIR) true orthophoto and a height grid (digital surface model; DSM)... -
Synthia→Cityscapes
The Synthia→Cityscapes task is a domain adaptation task for semantic segmentation, where the source domain is Synthia and the target domain is Cityscapes. -
GTA5 and SYNTHIA
The dataset used in the paper is GTA5 and SYNTHIA, which are used for domain adaptive semantic segmentation (DASS). -
ADE20K Dataset
The ADE20K dataset is a large-scale dataset for semantic segmentation. It contains 20,000 images with 150 semantic categories, with 20,000 images for training, 2,000 images for... -
CamVid Dataset
CamVid dataset is a benchmark dataset for semantic segmentation. It consists of 700 images with 11 object classes. -
Pascal VOC 2012
The dataset used in the paper is the Pascal VOC 2012 dataset, which is a benchmark for instance segmentation. The dataset consists of 1464 images with 20 class categories and... -
COCO Stuff
COCO Stuff dataset is an extension of the COCO dataset, 164,000 images covering 171 classes are annotated with segmentation masks. -
Pyramid scene parsing network
Pyramid scene parsing network for semantic segmentation. -
PASCAL Context
The PASCAL Context dataset is a benchmark for multi-task learning in computer vision. It contains 10103 images with 5 tasks: semantic segmentation, human body part segmentation,... -
CelebAMask-HQ
CelebAMask-HQ provides the parsing map of images in CelebA-HQ down-sampled to 512 × 512, where pixel-level annotation of 19 classes, including facial components and accessories,... -
PASCAL VOC 2007
Multi-label image recognition is a practical and challenging task compared to single-label image classification. -
Syntagen - Harnessing Generative Models for Synthetic Visual Datasets
The dataset is generated using a latent diffusion model, specifically Stable Diffusion 2.1, and is used for semantic segmentation tasks.