Scene Understanding - Groups

COCOStuff

COCOStuff is a scene-centric dataset with a total of 80 things and 91 stuff categories.

Dataset
JSON

Behave dataset

The Behave dataset contains various scenes with human-object interactions, and is used to evaluate the proposed object-level 3D semantic mapping approach.

Dataset
JSON

COCO-stuff dataset

The COCO-stuff dataset is a large-scale dataset for scene understanding, object detection, and image synthesis.

Dataset
JSON

RefCOCO dataset

The authors used the RefCOCO dataset, a large-scale dataset for object detection and scene understanding, to train and evaluate their models.

Dataset
JSON

ImageNet: A Large-Scale Hierarchical Image Database

The ImageNet dataset is a large-scale image database that contains over 14 million images, each labeled with one of 21,841 categories.

Dataset
JSON

Stanford dataset

The Stanford dataset consists of a large-scale collection of aerial images and videos of a university campus containing various agents (cars, buses, bicycles, golf carts,...

Dataset
JSON

Visual Genome

The Visual Genome dataset is a large-scale visual question answering dataset, containing 1.5 million images, each with 15-30 annotated entities, attributes, and relationships.

Dataset
JSON

Cityscapes

The Cityscapes dataset is a large and famous city street scene semantic segmentation dataset. 19 classes of which 30 classes of this dataset are considered for training and...

Dataset
JSON

KITTI dataset

The dataset used in the paper is the KITTI dataset, which is a benchmark for monocular depth estimation. The dataset consists of a large collection of images and corresponding...

Dataset
JSON

9 datasets found