-
Penn Action
The Penn Action dataset is a real video dataset of people performing various indoor and outdoor sports with annotations of human joint locations. -
UTD-MHAD dataset
UTD-MHAD dataset for human action recognition -
AID dataset
The AID dataset is a benchmark for scene classification in remote sensing. It contains aerial images with 30 scene types. -
SkateFormer: Skeletal-Temporal Transformer for Human Action Recognition
Skeleton-based action recognition, which classifies human actions based on the coordinates of joints and their connectivity within skeleton data. -
SBU kinect dataset
The SBU kinect dataset is an interaction dataset acquired using the Microsoft kinect sensor. -
Charades dataset
The Charades dataset is a dataset for human action recognition. It contains 200 videos with 3,800+ action instances. -
Epic-Kitchens-100
The Epic-Kitchens-100 dataset contains 97 verb and 300 noun classes with actions defined by the combination of nouns and verbs. -
Something-Something-V1 and V2
The Something-Something-V1 and V2 dataset contains 174 human action categories with 108K and 220K videos. -
UCF101: A Dataset of 101 Human Actions Classes from Videos in the Wild
The authors used the UCF101, HMDB51, and Diving48 datasets to evaluate the performance of their proposed algorithm. -
Kinetics dataset
The Kinetics dataset is a large-scale action recognition dataset. It contains videos of various actions performed by humans, with annotations of the actions performed. -
UCF-101 dataset
UCF-101 dataset is a large-scale action recognition dataset, containing 13,320 videos categorized into 101 human action categories. -
Pose-Aware Video Transformers
Human perception of surroundings is often guided by the various poses present within the environment. Many computer vision tasks, such as human action recognition and robot... -
Kinetics-600
The Kinetics-600 dataset consists of 392k training videos and 30k validation videos in 600 human action categories.