Dataset - LDM

NTU-120 minus 60

A large-scale labeled RGB-Depth dataset for action recognition, containing 942 training and 132 validation videos for 51 action classes.
- Dataset
- JSON
PKU-MMD

A large-scale labeled RGB-Depth-Optical Flow dataset for action recognition, containing 1,074 long untrimmed videos with paired RGB, depth, and optical flow modalities for 51...
- Dataset
- JSON
Anti-UAV: A Large Multi-Modal Benchmark for UAV Tracking

A large multi-modal benchmark for UAV tracking, containing high-quality and high-definition video sequences of both RGB and IR, each annotated with bounding boxes, attributes,...
- Dataset
- JSON
MANUS-Grasps

MANUS-Grasps is a large real-world multi-view RGB grasp dataset with over 7M frames from 53 cameras, providing full 360-degree coverage of 400+ grasps in over 30 diverse...
- Dataset
- JSON
SEDS: Semantically Enhanced Dual-Stream Encoder for Sign Language Retrieval

Sign language retrieval is more biased towards understanding the semantic information of human actions contained in video clips. The proposed framework addresses these issues by...
- Dataset
- JSON
Centered-Shoe

A unified object-centric implicit representation that can be used for RGB and depth novel view rendering, 3D reconstruction, and proposing stable grasps.
- Dataset
- JSON
Spike dataset

The dataset used in the paper is a spike dataset generated from RGB frames of four open access outdoor datasets, including Kitti, Driving Stereo, Driving Stereo Weather, and...
- Dataset
- JSON
KITTI dataset

The dataset used in the paper is the KITTI dataset, which is a benchmark for monocular depth estimation. The dataset consists of a large collection of images and corresponding...
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

8 datasets found