260 datasets found

Groups: Image Segmentation Formats: JSON

Filter Results
  • PhraseCut

    The PhraseCut dataset contains images annotated with referring expressions, focusing on phrases with a single entity and relation.
  • RefCOCO, RefCOCO+, and RefCOCOg

    Visual Grounding is a task that aims to locate a target object according to a natural language expression. The dataset used in this paper is RefCOCO, RefCOCO+, and RefCOCOg.
  • NYUDv2

    The NYUDv2 dataset contains 1,449 labeled indoor-scene RGB images with both parsing annotations and Kinect depths.
  • SUN RGB-D

    RGB-D scene recognition approaches often train two standalone backbones for RGB and depth modalities with the same Places or ImageNet pre-training. However, the pre-trained...
  • Probabilistic Atlas

    The probabilistic atlas dataset, which includes MRI scans of 20 subjects.
  • T1-weighted 3D brain MRI scans

    The dataset used for the proposed Segmentation Auto-Encoder (SAE) method for adaptive image segmentation.
  • COCO Dataset

    The COCO dataset is a large-scale dataset for object detection, semantic segmentation, and captioning. It contains 80 object categories and 1,000 image instances per category,...
  • FSS-1000

    The FSS-1000 dataset contains 1000 images with 1000 object categories.
  • Visual Genome

    The Visual Genome dataset is a large-scale visual question answering dataset, containing 1.5 million images, each with 15-30 annotated entities, attributes, and relationships.
  • ADE20k

    Semantic segmentation is one of the fundamental prob-lems in computer vision, whose task is to assign a seman-tic label to each pixel of an image so that different classes can...
  • GTA5→Cityscapes

    The GTA5→Cityscapes dataset is a synthetic-to-real benchmark dataset for domain adaptation in semantic segmentation.
  • Sewer Pipe Cracks Detection

    The Sewer Pipe Cracks dataset contains images of sewer pipes with and without cracks.
  • Surface Defect Saliency of Magnetic Tile

    The Magnetic Tile Defect Datasets contain images of magnetic tiles with and without defects.
  • MS-COCO

    Large scale datasets [18, 17, 27, 6] boosted text conditional image generation quality. However, in some domains it could be difficult to make such datasets and usually it could...
  • LSUN

    The dataset used for training and validation of the proposed approach to combine semantic segmentation and dense outlier detection.
  • CLIP

    The CLIP model and its variants are becoming the de facto backbone in many applications. However, training a CLIP model from hundreds of millions of image-text pairs can be...
  • LVIS

    Instance segmentation (IS) is an important computer vision task, aiming at simultaneously predicting the class label and the binary mask for each instance of interest in an image.
  • Cityscapes

    The Cityscapes dataset is a large and famous city street scene semantic segmentation dataset. 19 classes of which 30 classes of this dataset are considered for training and...
  • Microsoft COCO

    The Microsoft COCO dataset was used for training and evaluating the CNNs because it has become a standard benchmark for testing algorithms aimed at scene understanding and...
  • COCO

    Large scale datasets [18, 17, 27, 6] boosted text conditional image generation quality. However, in some domains it could be difficult to make such datasets and usually it could...