-
Penn Action
The Penn Action dataset is a real video dataset of people performing various indoor and outdoor sports with annotations of human joint locations. -
UTD-MHAD dataset
UTD-MHAD dataset for human action recognition -
AID dataset
The AID dataset is a benchmark for scene classification in remote sensing. It contains aerial images with 30 scene types. -
SBU kinect dataset
The SBU kinect dataset is an interaction dataset acquired using the Microsoft kinect sensor. -
Charades dataset
The Charades dataset is a dataset for human action recognition. It contains 200 videos with 3,800+ action instances. -
Epic-Kitchens-100
The Epic-Kitchens-100 dataset contains 97 verb and 300 noun classes with actions defined by the combination of nouns and verbs. -
Something-Something-V1 and V2
The Something-Something-V1 and V2 dataset contains 174 human action categories with 108K and 220K videos. -
UCF101: A Dataset of 101 Human Actions Classes from Videos in the Wild
The authors used the UCF101, HMDB51, and Diving48 datasets to evaluate the performance of their proposed algorithm. -
Pose-Aware Video Transformers
Human perception of surroundings is often guided by the various poses present within the environment. Many computer vision tasks, such as human action recognition and robot... -
Kinetics-400
Motion has shown to be useful for video understanding, where motion is typically represented by optical flow. However, computing flow from video frames is very time-consuming.... -
AVA-Kinetics
The AVA-Kinetics dataset is a video dataset of localized human actions.