Dataset - LDM

VVT

The VVT dataset is a collection of product garment images and corresponding videos.
- Dataset
- JSON
J-HMDB

A video dataset for action recognition, consisting of 920 videos of 21 different actions.
- Dataset
- JSON
Kinetics-400

Motion has shown to be useful for video understanding, where motion is typically represented by optical flow. However, computing flow from video frames is very time-consuming....
- Dataset
- JSON
Avenue

The Avenue data set consists of 16 training videos with a total 15328 frames and 21 test videos with a total of 15324.
- Dataset
- JSON
UCF101

The UCF101 dataset contains 13320 videos distributed in 101 action categories. This dataset is different from the above ones in that it contains mostly coarse sports activities...
- Dataset
- JSON
HMDB51

Video classification is a fundamental problem in many video-based tasks. Applications such as autonomous driving technology, controlling drones and robots are driving the demand...
- Dataset
- JSON
Kinetics

The Kinetics dataset is a large-scale human action dataset, which consists of 400 action classes where each category has more than 400 videos.
- Dataset
- JSON
UCF101 dataset

UCF101 dataset is used to test the proposed text-to-video model. The dataset contains 101 action categories, and each category has 10 videos. The videos are labeled with text...
- Dataset
- JSON
KITTI dataset

The dataset used in the paper is the KITTI dataset, which is a benchmark for monocular depth estimation. The dataset consists of a large collection of images and corresponding...
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

29 datasets found