77 datasets found

Formats: JSON

Filter Results
  • NYU Depth dataset

    The NYU Depth dataset is a large-scale dataset of indoor and outdoor scenes, which is widely used for 3D depth estimation and scene understanding tasks.
  • Super-CLEVR

    The Super-CLEVR dataset contains synthetic scenes of randomly placed vehicles from 5 categories (car, plane, bicycle, motorbike, bus) with various attributes (color, material,...
  • PASCAL-Context and NYUD-v2 datasets

    The PASCAL-Context and NYUD-v2 datasets are used for multi-task learning in dense scene understanding.
  • RIO-10 dataset

    The RIO-10 dataset contains 10 real indoor environments with varying lighting conditions, camera positions, and object arrangements.
  • Places205

    Places205 is a dataset of 2.5 million images from 205 categories, with 12,000 images per category.
  • NSVF-Synthetic

    The NSVF-Synthetic dataset is a synthetic dataset for neural sparse voxel fields.
  • Argoverse 2

    The Argoverse 2 motion forecasting dataset contains 250,000 driving scenarios, each 11 seconds long. These scenarios cover 6 geographical regions and represent 763 total hours...
  • FS-COCO

    FS-COCO: A large-scale scene sketch dataset with fine-grained alignment among sketch, text, and photo.
  • SketchyCOCO

    SketchyCOCO: A large-scale scene sketch dataset with fine-grained alignment among sketch, text, and photo.
  • Scan2CAD

    The Scan2CAD dataset contains object-level human-generated annotations. The annotations include category label, segmentation, a similar CAD model in ShapeNet, and the...
  • CAKES: Channel-wise Automatic KErnel Shrinking for Efficient 3D Networks

    3D Convolution Neural Networks (CNNs) have been widely applied to 3D scene understanding, such as video analysis and volumetric image recognition.
  • Behave dataset

    The Behave dataset contains various scenes with human-object interactions, and is used to evaluate the proposed object-level 3D semantic mapping approach.
  • UniPerf

    The UniPerf dataset is a benchmark for perceptual parsing for scene understanding.
  • Slot Attention

    A dataset of videos of a robot interacting with blocks of different shapes and colors placed on a table in a simulation environment.
  • MSRC

    The dataset used in the paper is a multi-view clustering dataset, which contains 6 views of 2000 samples each. The dataset is used to evaluate the performance of the proposed...
  • LayoutMP3D

    A dataset for layout annotation of Matterport3D images.
  • Natural Scenes Dataset (NSD)

    The Natural Scenes Dataset (NSD) is a large-scale fMRI dataset used for visual decoding. It features in-depth recordings of brain activities from 8 participants who passively...
  • COCO-stuff dataset

    The COCO-stuff dataset is a large-scale dataset for scene understanding, object detection, and image synthesis.
  • Cityspace

    The dataset used for training and testing the proposed RGBD-based obstacle avoidance system for visually impaired people.
  • LSUN Tower

    LSUN Tower dataset is a subset of the LSUN dataset, with 708,264 images.