1,082 datasets found

Groups: Computer Vision Organizations: No Organization

Filter Results
  • NeRF

    NeRF [33] has demonstrated amazing ability to synthesize images of 3D scenes from novel views. However, they rely upon specialized volumetric rendering algorithms based on ray...
  • DomainNet

    The dataset used in the paper is a cross-domain dataset, consisting of six domains: Real, Painting, Sketch, Clipart, Infograph, and Quickdraw. Each domain contains 345 object...
  • SCUT-HEAD Dataset

    The SCUT-HEAD dataset is a head detection dataset containing images with varying scales and poses.
  • Visual Wake Words Dataset

    The Visual Wake Words dataset is a binary classification dataset for detecting the presence of a person in an image.
  • ImageNet-10 Dataset

    The ImageNet-10 dataset is a subset of the ImageNet-1K dataset, containing images from 10 classes.
  • WIDER FACE Dataset

    The WIDER FACE dataset is a face detection dataset containing images with varying scales, poses, and occlusions.
  • Vision-based Target Pose Estimation with Multiple Markers for the Perching of...

    A vision-based target pose estimation method using multiple markers for high-precision nano drone perching at both wide and close ranges.
  • MS-COCO

    Large scale datasets [18, 17, 27, 6] boosted text conditional image generation quality. However, in some domains it could be difficult to make such datasets and usually it could...
  • dsprites: Disentanglement testing sprites dataset

    dsprites: Disentanglement testing sprites dataset
  • MegaDepth

    Feature matching is a fundamental problem for many computer vision tasks, such as object recognition, structure from motion, and simultaneous localization and mapping.
  • DIV2K

    Single Image Super-Resolution (SR) aims to generate a High Resolution (HR) image I SR from a low resolution (LR) im-age I LR such that it is similar to original HR image I HR....
  • LSUN

    The dataset used for training and validation of the proposed approach to combine semantic segmentation and dense outlier detection.
  • CLIP

    The CLIP model and its variants are becoming the de facto backbone in many applications. However, training a CLIP model from hundreds of millions of image-text pairs can be...
  • DDAD dataset

    The DDAD dataset is a new autonomous driving benchmark from Toyota Research Institute for long-range (up to 250m).
  • MSDC-Net: Multi-Scale Dense and Contextual Networks for Automated Disparity M...

    Disparity prediction from stereo images is essential to computer vision applications including autonomous driving, 3D model reconstruction, and object detection.
  • Cityscapes

    The Cityscapes dataset is a large and famous city street scene semantic segmentation dataset. 19 classes of which 30 classes of this dataset are considered for training and...
  • KITTI dataset

    The dataset used in the paper is the KITTI dataset, which is a benchmark for monocular depth estimation. The dataset consists of a large collection of images and corresponding...
  • ShapeNetCore

    The ShapeNetCore dataset is a large-scale 3D model dataset, containing 44,000 3D models and 13 categories.
  • CIFAR-10, CIFAR-100, and ImageNet

    The dataset used in the paper is not explicitly described, but it is mentioned that the authors used CIFAR-10, CIFAR-100, and ImageNet datasets.
  • Bollywood dataset

    The Bollywood dataset is a collection of images of Bollywood celebrities with varying body mass indexes (BMIs). The dataset is used for face-to-BMI prediction.
You can also access this registry using the API (see API Docs).