Dataset - LDM

PortraitMode-400

PortraitMode-400 is a dataset dedicated to portrait mode video recognition, with a fine-grained taxonomy of 400 categories.
- Dataset
- JSON
Penn Action

The Penn Action dataset is a real video dataset of people performing various indoor and outdoor sports with annotations of human joint locations.
- Dataset
- JSON
MMX-Trailer-20 Dataset

Long form video understanding (LVU) is a sub-domain of video recognition concerned with understanding contextual information across contiguous shots which can contain multiple...
- Dataset
- JSON
LocalStyleFool

LocalStyleFool: Regional Video Style Transfer Attack Using Segment Anything
- Dataset
- JSON
Diving48

The Diving48 dataset is a fine-grained video dataset of competitive diving. It has ∼18k trimmed video clips of 48 unambiguous dive sequences standardized by the professional....
- Dataset
- JSON
Mini-Kinetics

The Mini-Kinetics dataset is a mini version of the Kinetics-400 dataset, containing 240k training samples and 20k validation samples in 400 human action classes.
- Dataset
- JSON
HMDB51 dataset

The HMDB51 dataset is a video dataset for human action recognition. It contains 6,767 videos annotated with 51 categories of human actions.
- Dataset
- JSON
Kinetics-700 dataset

The Kinetics-700 dataset is a large-scale video dataset for human action recognition. It contains 555,774 videos annotated with 700 categories of human actions.
- Dataset
- JSON
KTH

The KTH dataset consists of videos of 25 people performing different activities.
- Dataset
- JSON
HowTo100M

The dataset used in the LORD framework for autonomous driving, consisting of images, videos, and text-based observations.
- Dataset
- JSON
Moments in Time

The Moments in Time dataset is a large-scale video action recognition dataset.
- Dataset
- JSON
MoViNets: Mobile Video Networks for Efficient Video Recognition

Mobile Video Networks (MoViNets) is a family of computation and memory efficient video networks that can operate on streaming video for online inference.
- Dataset
- JSON
Jester

The Jester dataset is of continuous jokes ratings from -10 to 10, containing the jokes’ texts.
- Dataset
- JSON
Something-Something V1

Video classification is a fundamental problem in many video-based tasks. Applications such as autonomous driving technology, controlling drones and robots are driving the demand...
- Dataset
- JSON
Kinetics-600

The Kinetics-600 dataset consists of 392k training videos and 30k validation videos in 600 human action categories.
- Dataset
- JSON
HMDB-51

Motion has shown to be useful for video understanding, where motion is typically represented by optical flow. However, computing flow from video frames is very time-consuming....
- Dataset
- JSON
Kinetics-400

Motion has shown to be useful for video understanding, where motion is typically represented by optical flow. However, computing flow from video frames is very time-consuming....
- Dataset
- JSON
Something-Something V1 & V2

The Something-Something V1 & V2 dataset is a large-scale video dataset created by crowdsourcing. It contains about 100k videos over 174 categories, and the number of videos...
- Dataset
- JSON
HMDB51

Video classification is a fundamental problem in many video-based tasks. Applications such as autonomous driving technology, controlling drones and robots are driving the demand...
- Dataset
- JSON
Charades

The dataset used for video action classification, consisting of 9.8k training videos, 1.8k validation videos, and 157 classes.
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

25 datasets found