-
MultiTHUMOS
Temporal action localization (TAL) is a prevailing task due to its great application potential. Existing works in this field mainly suffer from two weaknesses: (1) They often... -
Something-Something V1
Video classification is a fundamental problem in many video-based tasks. Applications such as autonomous driving technology, controlling drones and robots are driving the demand... -
Mini-Kinetics-200
Mini-Kinetics-200: A dataset of 200 human action classes from videos in the wild. -
H36M-Original
The H36M-Original dataset consists of 6 action classes, with 16 frames per class. -
NTU-Original
The NTU-Original dataset consists of 49 single-person action classes from the real world. -
Kinetics-600
The Kinetics-600 dataset consists of 392k training videos and 30k validation videos in 600 human action categories. -
HumanEva-I
The HumanEva-I dataset is a dataset for 3D human pose estimation, containing video sequences of four subjects performing six common actions. -
Kinetics-400
Motion has shown to be useful for video understanding, where motion is typically represented by optical flow. However, computing flow from video frames is very time-consuming.... -
Something-Something V1 & V2
The Something-Something V1 & V2 dataset is a large-scale video dataset created by crowdsourcing. It contains about 100k videos over 174 categories, and the number of videos... -
UIUC-Sports dataset
UIUC-Sports dataset -
Florence3D-Action
The dataset contains 3D human skeletons and corresponding action labels. -
UTKinect-Action
The dataset contains 3D human skeletons and corresponding action labels.