13 datasets found

Tags: image classification

Filter Results
  • SegFormer: Simple and Efficient Design for Semantic Segmentation with Transfo...

    Semantic segmentation is a fundamental task in computer vision and enables many downstream applications. It is related to image classification since it produces per-pixel...
  • ADE20K Dataset

    The ADE20K dataset is a large-scale dataset for semantic segmentation. It contains 20,000 images with 150 semantic categories, with 20,000 images for training, 2,000 images for...
  • CelebAMask-HQ

    CelebAMask-HQ provides the parsing map of images in CelebA-HQ down-sampled to 512 × 512, where pixel-level annotation of 19 classes, including facial components and accessories,...
  • PASCAL VOC 2007

    Multi-label image recognition is a practical and challenging task compared to single-label image classification.
  • Pascal VOC

    Semantic segmentation is a crucial and challenging task for image understanding. It aims to predict a dense labeling map for the input image, which assigns each pixel a unique...
  • TrashNet

    The TrashNet dataset contains images of waste, focusing on recycling waste classification.
  • COCO Dataset

    The COCO dataset is a large-scale dataset for object detection, semantic segmentation, and captioning. It contains 80 object categories and 1,000 image instances per category,...
  • MS-COCO

    Large scale datasets [18, 17, 27, 6] boosted text conditional image generation quality. However, in some domains it could be difficult to make such datasets and usually it could...
  • Cityscapes

    The Cityscapes dataset is a large and famous city street scene semantic segmentation dataset. 19 classes of which 30 classes of this dataset are considered for training and...
  • Microsoft COCO

    The Microsoft COCO dataset was used for training and evaluating the CNNs because it has become a standard benchmark for testing algorithms aimed at scene understanding and...
  • ImageNet-1k

    The dataset used in the paper is not explicitly described, but it is mentioned that the authors used it for language modeling and image classification tasks.
  • COCO

    Large scale datasets [18, 17, 27, 6] boosted text conditional image generation quality. However, in some domains it could be difficult to make such datasets and usually it could...
  • MSCOCO

    Human Pose Estimation (HPE) aims to estimate the position of each joint point of the human body in a given image. HPE tasks support a wide range of downstream tasks such as...