13 datasets found

Tags: action recognition

Filter Results
  • AVA v2.2

    The AVA v2.2 dataset for spatiotemporal action localization contains the bounding box annotations and the corresponding action labels on keyframes.
  • Mini-Kinetics

    The Mini-Kinetics dataset is a mini version of the Kinetics-400 dataset, containing 240k training samples and 20k validation samples in 400 human action classes.
  • HMDB51 dataset

    The HMDB51 dataset is a video dataset for human action recognition. It contains 6,767 videos annotated with 51 categories of human actions.
  • Kinetics dataset

    The Kinetics dataset is a large-scale action recognition dataset. It contains videos of various actions performed by humans, with annotations of the actions performed.
  • UCF-101 dataset

    UCF-101 dataset is a large-scale action recognition dataset, containing 13,320 videos categorized into 101 human action categories.
  • Kinetics-600

    The Kinetics-600 dataset consists of 392k training videos and 30k validation videos in 600 human action categories.
  • HMDB-51

    Motion has shown to be useful for video understanding, where motion is typically represented by optical flow. However, computing flow from video frames is very time-consuming....
  • Kinetics-400

    Motion has shown to be useful for video understanding, where motion is typically represented by optical flow. However, computing flow from video frames is very time-consuming....
  • UCF101

    The UCF101 dataset contains 13320 videos distributed in 101 action categories. This dataset is different from the above ones in that it contains mostly coarse sports activities...
  • HMDB51

    Video classification is a fundamental problem in many video-based tasks. Applications such as autonomous driving technology, controlling drones and robots are driving the demand...
  • Kinetics

    The Kinetics dataset is a large-scale human action dataset, which consists of 400 action classes where each category has more than 400 videos.
  • UCF101 dataset

    UCF101 dataset is used to test the proposed text-to-video model. The dataset contains 101 action categories, and each category has 10 videos. The videos are labeled with text...
  • ActivityNet

    Temporal activity detection has drawn increasing interests in both academic and industry communities due to its vast potential applications in security surveillance, behavior...