42 datasets found

Tags: Scene Understanding

Filter Results
  • MME

    MME: A comprehensive evaluation benchmark for multimodal large language models
  • Pyramid scene parsing network

    Pyramid scene parsing network for semantic segmentation.
  • SUN2012 Dataset

    The SUN2012 dataset is a challenging dataset for object detection, with large cluttered scenes and small objects.
  • Places

    The dataset used in the paper is Places, a large dataset of 400k pairs of images from the Places 205 dataset and corresponding spoken audio captions.
  • ScanNetv2, ScanNet200, and Replica

    ScanNetv2, ScanNet200, and Replica datasets for 3D instance segmentation
  • Scannet

    The dataset used for training and testing the proposed RGBD-based obstacle avoidance system for visually impaired people.
  • Indoor Segmentation and Support Inference from RGB-D Images

    Indoor segmentation and support inference from RGB-D images.
  • SUN RGB-D dataset

    The SUN RGB-D dataset includes 10335 RGB-D captures. The dataset was captured from different RGB-D sensors including Asus Xtion, RealSense, Kinect v1 and Kinect v2. Following...
  • Shapenet

    Shapenet is a large-scale synthesis 3D object dataset, where we follow [9] to use the official test splits of chair, car, and motorbike categories for evaluation since they...
  • WRGB-D Scenes Dataset

    A large-scale hierarchical multi-view RGB-D object dataset.
  • BigBird Dataset

    A large-scale 3D database of object instances for scene understanding.
  • Tanks and Temples

    Neural Radiance Fields (NeRFs) model a 3D scene as a volumetric function, which can be rendered from arbitrary viewpoints to generate highly-realistic images.
  • Replica

    The Replica dataset contains 18 various highly photo-realistic indoor environments. It provides dense-mesh, high-resolution RGBD images and a large range of instance annotations...
  • Objaverse

    The Objaverse dataset contains around 800k 3D objects. After adopting simple filter leveraging CLIP [27] to remove the objects whose rendered images are not relevant to its...
  • ONECE Dataset

    A dataset for 3D object detection from LiDAR point clouds, containing 5,000 training frames and 3,000 validation frames.
  • NYUDv2

    The NYUDv2 dataset contains 1,449 labeled indoor-scene RGB images with both parsing annotations and Kinect depths.
  • SUN RGB-D

    RGB-D scene recognition approaches often train two standalone backbones for RGB and depth modalities with the same Places or ImageNet pre-training. However, the pre-trained...
  • S3DIS

    The dataset used in the paper is a real-world 3D point cloud dataset, which is used for 3D shape classification, part segmentation, and shape retrieval tasks.
  • LSUN

    The dataset used for training and validation of the proposed approach to combine semantic segmentation and dense outlier detection.
  • Cityscapes

    The Cityscapes dataset is a large and famous city street scene semantic segmentation dataset. 19 classes of which 30 classes of this dataset are considered for training and...
You can also access this registry using the API (see API Docs).