Dataset - LDM

MultiTHUMOS

Temporal action localization (TAL) is a prevailing task due to its great application potential. Existing works in this field mainly suffer from two weaknesses: (1) They often...
- Dataset
- JSON
Something-Something V1

Video classification is a fundamental problem in many video-based tasks. Applications such as autonomous driving technology, controlling drones and robots are driving the demand...
- Dataset
- JSON
Mini-Kinetics-200

Mini-Kinetics-200: A dataset of 200 human action classes from videos in the wild.
- Dataset
- JSON
SSv2-Full

Action recognition is a fundamental topic in video understanding and has made significant progress in recent years.
- Dataset
- JSON
J-HMDB

A video dataset for action recognition, consisting of 920 videos of 21 different actions.
- Dataset
- JSON
H36M-4A

The H36M-4A dataset is derived from the H36M-Original dataset, using the 4A pipeline to generate synthetic data.
- Dataset
- JSON
H36M-Original

The H36M-Original dataset consists of 6 action classes, with 16 frames per class.
- Dataset
- JSON
NTU-4A

The NTU-4A dataset is derived from the NTU-Original dataset, using the 4A pipeline to generate synthetic data.
- Dataset
- JSON
NTU-Original

The NTU-Original dataset consists of 49 single-person action classes from the real world.
- Dataset
- JSON
Kinetics-600

The Kinetics-600 dataset consists of 392k training videos and 30k validation videos in 600 human action categories.
- Dataset
- JSON
HumanEva-I

The HumanEva-I dataset is a dataset for 3D human pose estimation, containing video sequences of four subjects performing six common actions.
- Dataset
- JSON
FineGym

FineGym is a hierarchical video dataset for fine-grained action understanding, containing 354 action categories.
- Dataset
- JSON
Resound

Resound is a video dataset for action recognition without representation bias.
- Dataset
- JSON
Kinetics-400

Motion has shown to be useful for video understanding, where motion is typically represented by optical flow. However, computing flow from video frames is very time-consuming....
- Dataset
- JSON
Something-Something V1 & V2

The Something-Something V1 & V2 dataset is a large-scale video dataset created by crowdsourcing. It contains about 100k videos over 174 categories, and the number of videos...
- Dataset
- JSON
UIUC-Sports dataset

UIUC-Sports dataset
- Dataset
- JSON
UCF101

The UCF101 dataset contains 13320 videos distributed in 101 action categories. This dataset is different from the above ones in that it contains mostly coarse sports activities...
- Dataset
- JSON
HMDB51

Video classification is a fundamental problem in many video-based tasks. Applications such as autonomous driving technology, controlling drones and robots are driving the demand...
- Dataset
- JSON
Florence3D-Action

The dataset contains 3D human skeletons and corresponding action labels.
- Dataset
- JSON
UTKinect-Action

The dataset contains 3D human skeletons and corresponding action labels.
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

69 datasets found