Image Segmentation - Groups

Vaihingen dataset

The Vaihingen dataset consists of 1440 scenes with a size of 250×250 pixels. Each scene is a colour-infrared (CIR) true orthophoto and a height grid (digital surface model; DSM)...

Dataset
JSON

Synthia

The Synthia dataset is a large-scale urban scene understanding dataset, containing 9000 samples. It is used for semantic segmentation tasks.

Dataset
JSON

PASCAL-5i

Few-shot segmentation remains challenging due to the limitations of its labeling information for unseen classes. Most previous approaches rely on extracting high-level feature...

Dataset
JSON

GTA

The GTA dataset consists of 24966 synthetic images synthetically generated from a video game consisting of outdoor scenes with rich variety of variations in lighting and trafﬁc...

Dataset
JSON

NYUD-v2

The NYUD-v2 dataset is a benchmark for indoor scene segmentation and depth estimation. It contains 1449 images with 4 tasks: semantic segmentation, depth estimation, surface...

Dataset
JSON

Synthia→Cityscapes

The Synthia→Cityscapes task is a domain adaptation task for semantic segmentation, where the source domain is Synthia and the target domain is Cityscapes.

Dataset
JSON

GTA5 and SYNTHIA

The dataset used in the paper is GTA5 and SYNTHIA, which are used for domain adaptive semantic segmentation (DASS).

Dataset
JSON

ADE20K Dataset

The ADE20K dataset is a large-scale dataset for semantic segmentation. It contains 20,000 images with 150 semantic categories, with 20,000 images for training, 2,000 images for...

Dataset
JSON

CamVid Dataset

CamVid dataset is a benchmark dataset for semantic segmentation. It consists of 700 images with 11 object classes.

Dataset
JSON

Pascal VOC 2012

The dataset used in the paper is the Pascal VOC 2012 dataset, which is a benchmark for instance segmentation. The dataset consists of 1464 images with 20 class categories and...

Dataset
JSON

COCO Stuff

COCO Stuff dataset is an extension of the COCO dataset, 164,000 images covering 171 classes are annotated with segmentation masks.

Dataset
JSON

Pyramid scene parsing network

Pyramid scene parsing network for semantic segmentation.

Dataset
JSON

PASCAL Context

The PASCAL Context dataset is a benchmark for multi-task learning in computer vision. It contains 10103 images with 5 tasks: semantic segmentation, human body part segmentation,...

Dataset
JSON

CamVid

The dataset used in the paper is a pre-trained ResNet-50 classiﬁer, which is used for image synthesis, unpaired image-to-image translation, and feature similarity estimation.

Dataset
JSON

CelebAMask-HQ

CelebAMask-HQ provides the parsing map of images in CelebA-HQ down-sampled to 512 × 512, where pixel-level annotation of 19 classes, including facial components and accessories,...

Dataset
JSON

SUN-RGBD

The dataset is used for indoor scene understanding and contains RGB and depth images.

Dataset
JSON

PASCAL VOC 2007

Multi-label image recognition is a practical and challenging task compared to single-label image classification.

Dataset
JSON

NYUv2

Multi-task learning (MTL) research is broadly divided into two categories: one is to learn the correlation between tasks through model structures, and the other is to balance...

Dataset
JSON

Syntagen - Harnessing Generative Models for Synthetic Visual Datasets

The dataset is generated using a latent diffusion model, specifically Stable Diffusion 2.1, and is used for semantic segmentation tasks.

Dataset
JSON

LoveDA

The LoveDA dataset contains high-spatial-resolution images from three different cities, focusing on improving the generalization capability of model from different urban and...

Dataset
JSON

29 datasets found