Dataset - LDM

Kinetics400

Video classification is a fundamental problem in many video-based tasks. Applications such as autonomous driving technology, controlling drones and robots are driving the demand...
- Dataset
- JSON
Video-MNIST

Video-MNIST is a novel variant of the classic MNIST dataset. It contains 70000 sequences, each sequence containing 30 frames showing an affine transformation on a single...
- Dataset
- JSON
Something-Something V1

Video classification is a fundamental problem in many video-based tasks. Applications such as autonomous driving technology, controlling drones and robots are driving the demand...
- Dataset
- JSON
FineGym

FineGym is a hierarchical video dataset for fine-grained action understanding, containing 354 action categories.
- Dataset
- JSON
Resound

Resound is a video dataset for action recognition without representation bias.
- Dataset
- JSON
Structural Vision Transformer

Structural Vision Transformer (StructViT) is a vision transformer network that leverages structural self-attention (StructSA) to capture correlation structures in images and...
- Dataset
- JSON
Kinetics-400

Motion has shown to be useful for video understanding, where motion is typically represented by optical flow. However, computing flow from video frames is very time-consuming....
- Dataset
- JSON
Something-Something V1 & V2

The Something-Something V1 & V2 dataset is a large-scale video dataset created by crowdsourcing. It contains about 100k videos over 174 categories, and the number of videos...
- Dataset
- JSON
UCF101

The UCF101 dataset contains 13320 videos distributed in 101 action categories. This dataset is different from the above ones in that it contains mostly coarse sports activities...
- Dataset
- JSON
HMDB51

Video classification is a fundamental problem in many video-based tasks. Applications such as autonomous driving technology, controlling drones and robots are driving the demand...
- Dataset
- JSON
Kinetics-700

Kinetics-700 is a large-scale video dataset for human action recognition, with 700 action categories.
- Dataset
- JSON
Youtube-8M

Youtube-8M is a large-scale video classification benchmark.
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

12 datasets found