-
SegFormer: Simple and Efficient Design for Semantic Segmentation with Transfo...
Semantic segmentation is a fundamental task in computer vision and enables many downstream applications. It is related to image classification since it produces per-pixel... -
ADE20K Dataset
The ADE20K dataset is a large-scale dataset for semantic segmentation. It contains 20,000 images with 150 semantic categories, with 20,000 images for training, 2,000 images for... -
CelebAMask-HQ
CelebAMask-HQ provides the parsing map of images in CelebA-HQ down-sampled to 512 × 512, where pixel-level annotation of 19 classes, including facial components and accessories,... -
PASCAL VOC 2007
Multi-label image recognition is a practical and challenging task compared to single-label image classification. -
Pascal VOC
Semantic segmentation is a crucial and challenging task for image understanding. It aims to predict a dense labeling map for the input image, which assigns each pixel a unique... -
COCO Dataset
The COCO dataset is a large-scale dataset for object detection, semantic segmentation, and captioning. It contains 80 object categories and 1,000 image instances per category,... -
Cityscapes
The Cityscapes dataset is a large and famous city street scene semantic segmentation dataset. 19 classes of which 30 classes of this dataset are considered for training and... -
Microsoft COCO
The Microsoft COCO dataset was used for training and evaluating the CNNs because it has become a standard benchmark for testing algorithms aimed at scene understanding and... -
ImageNet-1k
The dataset used in the paper is not explicitly described, but it is mentioned that the authors used it for language modeling and image classification tasks.