16 datasets found

Groups: Computer Vision Formats: JSON

Filter Results
  • ETH3D

    ETH3D is a small real-world grayscale dataset with both indoor and outdoor scenes. It contains 27 labeled stereo image pairs for training and 20 stereo pairs for testing.
  • RIO-10 dataset

    The RIO-10 dataset contains 10 real indoor environments with varying lighting conditions, camera positions, and object arrangements.
  • Places205

    Places205 is a dataset of 2.5 million images from 205 categories, with 12,000 images per category.
  • ImageNet: A Large-Scale Hierarchical Image Database

    The ImageNet dataset is a large-scale image database that contains over 14 million images, each labeled with one of 21,841 categories.
  • Places

    The dataset used in the paper is Places, a large dataset of 400k pairs of images from the Places 205 dataset and corresponding spoken audio captions.
  • Argoverse

    The Argoverse dataset is a large-scale dataset for autonomous driving, containing 3D point clouds, semantic segmentation masks, and instance segmentation masks.
  • Scannet

    The dataset used for training and testing the proposed RGBD-based obstacle avoidance system for visually impaired people.
  • PandaSet

    A multi-modal dataset capturing self-driving scenes in various conditions, including different times of day and weather.
  • Shapenet

    Shapenet is a large-scale synthesis 3D object dataset, where we follow [9] to use the official test splits of chair, car, and motorbike categories for evaluation since they...
  • YFCC100M

    The dataset used in the paper is YFCC100M, a large-scale video dataset. The dataset is used for foreground and background patch extraction and object recognition tasks.
  • SUN RGB-D

    RGB-D scene recognition approaches often train two standalone backbones for RGB and depth modalities with the same Places or ImageNet pre-training. However, the pre-trained...
  • NeRF-Synthetic

    A point cloud rendering method that achieves comparable rendering performance to NeRF.
  • NeRF

    NeRF [33] has demonstrated amazing ability to synthesize images of 3D scenes from novel views. However, they rely upon specialized volumetric rendering algorithms based on ray...
  • LSUN

    The dataset used for training and validation of the proposed approach to combine semantic segmentation and dense outlier detection.
  • Cityscapes

    The Cityscapes dataset is a large and famous city street scene semantic segmentation dataset. 19 classes of which 30 classes of this dataset are considered for training and...
  • KITTI dataset

    The dataset used in the paper is the KITTI dataset, which is a benchmark for monocular depth estimation. The dataset consists of a large collection of images and corresponding...