Scene Understanding - Groups

ETH3D

ETH3D is a small real-world grayscale dataset with both indoor and outdoor scenes. It contains 27 labeled stereo image pairs for training and 20 stereo pairs for testing.

Dataset
JSON

RIO-10 dataset

The RIO-10 dataset contains 10 real indoor environments with varying lighting conditions, camera positions, and object arrangements.

Dataset
JSON

Places205

Places205 is a dataset of 2.5 million images from 205 categories, with 12,000 images per category.

Dataset
JSON

ImageNet: A Large-Scale Hierarchical Image Database

The ImageNet dataset is a large-scale image database that contains over 14 million images, each labeled with one of 21,841 categories.

Dataset
JSON

Places

The dataset used in the paper is Places, a large dataset of 400k pairs of images from the Places 205 dataset and corresponding spoken audio captions.

Dataset
JSON

Argoverse

The Argoverse dataset is a large-scale dataset for autonomous driving, containing 3D point clouds, semantic segmentation masks, and instance segmentation masks.

Dataset
JSON

Scannet

The dataset used for training and testing the proposed RGBD-based obstacle avoidance system for visually impaired people.

Dataset
JSON

PandaSet

A multi-modal dataset capturing self-driving scenes in various conditions, including different times of day and weather.

Dataset
JSON

Shapenet

Shapenet is a large-scale synthesis 3D object dataset, where we follow [9] to use the official test splits of chair, car, and motorbike categories for evaluation since they...

Dataset
JSON

YFCC100M

The dataset used in the paper is YFCC100M, a large-scale video dataset. The dataset is used for foreground and background patch extraction and object recognition tasks.

Dataset
JSON

SUN RGB-D

RGB-D scene recognition approaches often train two standalone backbones for RGB and depth modalities with the same Places or ImageNet pre-training. However, the pre-trained...

Dataset
JSON

NeRF-Synthetic

A point cloud rendering method that achieves comparable rendering performance to NeRF.

Dataset
JSON

NeRF

NeRF [33] has demonstrated amazing ability to synthesize images of 3D scenes from novel views. However, they rely upon specialized volumetric rendering algorithms based on ray...

Dataset
JSON

LSUN

The dataset used for training and validation of the proposed approach to combine semantic segmentation and dense outlier detection.

Dataset
JSON

Cityscapes

The Cityscapes dataset is a large and famous city street scene semantic segmentation dataset. 19 classes of which 30 classes of this dataset are considered for training and...

Dataset
JSON

KITTI dataset

The dataset used in the paper is the KITTI dataset, which is a benchmark for monocular depth estimation. The dataset consists of a large collection of images and corresponding...

Dataset
JSON

16 datasets found