Dataset - LDM

Fine-tuned CLIP Models are Efficient Video Learners

This work explores the capability of a simple baseline called ViFi-CLIP (Video Fine-tuned CLIP) for adapting image-based CLIP to video domain.
- Dataset
- JSON
Mini-Kinetics

The Mini-Kinetics dataset is a mini version of the Kinetics-400 dataset, containing 240k training samples and 20k validation samples in 400 human action classes.
- Dataset
- JSON
HMDB51 and UCF101

The dataset used in the paper is HMDB51 and UCF101.
- Dataset
- JSON
Kinetics-400 and Something-Something-V2

The dataset used in the paper is Kinetics-400 and Something-Something-V2.
- Dataset
- JSON
Kinetics-400 and Kinetics-600

The Kinetics-400 and Kinetics-600 datasets are video understanding datasets used for learning rich and multi-scale spatiotemporal semantics from high-dimensional videos.
- Dataset
- JSON
Kinetics-600

The Kinetics-600 dataset consists of 392k training videos and 30k validation videos in 600 human action categories.
- Dataset
- JSON
HMDB-51

Motion has shown to be useful for video understanding, where motion is typically represented by optical flow. However, computing flow from video frames is very time-consuming....
- Dataset
- JSON
Kinetics-400

Motion has shown to be useful for video understanding, where motion is typically represented by optical flow. However, computing flow from video frames is very time-consuming....
- Dataset
- JSON
JHMDB51

A video dataset of 51 human actions classes.
- Dataset
- JSON
AVA-Kinetics

The AVA-Kinetics dataset is a video dataset of localized human actions.
- Dataset
- JSON
UCF101

The UCF101 dataset contains 13320 videos distributed in 101 action categories. This dataset is different from the above ones in that it contains mostly coarse sports activities...
- Dataset
- JSON
HMDB51

Video classification is a fundamental problem in many video-based tasks. Applications such as autonomous driving technology, controlling drones and robots are driving the demand...
- Dataset
- JSON
Kinetics

The Kinetics dataset is a large-scale human action dataset, which consists of 400 action classes where each category has more than 400 videos.
- Dataset
- JSON
ActivityNet

Temporal activity detection has drawn increasing interests in both academic and industry communities due to its vast potential applications in security surveillance, behavior...
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

14 datasets found