Dataset - LDM

MoSeg: A Dataset for Object Segmentation in Videos

MoSeg is a dataset for object segmentation in videos.
- Dataset
- JSON
UCSD Pedestrian

The dataset used for local anomaly detection in videos using object-centric adversarial learning.
- Dataset
- JSON
Kinetics: A Large-Scale Video Benchmark for Human Action Recognition

A large-scale video benchmark for human action recognition.
- Dataset
- JSON
Video-Specific Query-Key Attention Modeling for Weakly-Supervised Temporal Ac...

Weakly-supervised temporal action localization aims to identify and localize the action instances in the untrimmed videos with only video-level action labels.
- Dataset
- JSON
YouTube Clickbait Detection Dataset

The dataset is a collection of online videos from YouTube, with comments and metadata. It is used to evaluate the performance of the Online Video Clickbait Protector (OVCP) scheme.
- Dataset
- JSON
Subway Exit

The Subway Exit dataset contains video clips of people leaving a train station.
- Dataset
- JSON
Subway Entrance

The Subway Entrance dataset contains video clips of people waiting to board a train.
- Dataset
- JSON
CityCam

CityCam dataset contains recordings from 212 trafﬁc cameras in the United States, with a resolution of 352x240 and a frame rate of 1 frame/per second.
- Dataset
- JSON
OTB2013, OTB2015, VOT2015, VOT2016

Visual object tracking, which tracks a specified target in a changing video sequence automatically, is a fundamental problem in many topics such as visual analysis, automatic...
- Dataset
- JSON
Crowd-11

A fine-grained crowd behavior analysis dataset with 16 videos and annotated crowd behaviors.
- Dataset
- JSON
MED

A crowd emotion dataset with 31 videos and annotated crowd emotions.
- Dataset
- JSON
WorldExpo'10

A crowd counting dataset with over 1000 labeled videos captured by over 100 monitoring cameras.
- Dataset
- JSON
Crowd Video Captioning Dataset

A crowd video captioning dataset based on the WorldExpo'10 dataset, with 98 videos selected and captions generated for them.
- Dataset
- JSON
Vcdb: A Large-Scale Database for Partial Copy Detection in Videos

A large-scale database for partial copy detection in videos.
- Dataset
- JSON
Breast Ultrasound Video Diagnosis

The dataset is used for breast cancer diagnosis in ultrasound videos.
- Dataset
- JSON
Tinyvideos dataset

Tinyvideos dataset
- Dataset
- JSON
STV-IDL

The STV-IDL dataset is a collection of videos with referring expressions that describe spatio-temporal events.
- Dataset
- JSON
VisDrone2019 Dataset

The VisDrone2019 dataset contains 288 video clips made up of 261,908 frames and 10,209 images
- Dataset
- JSON
Day-to-Day Video Dataset

A dataset of 30 videos of length 3 minutes to 20 minutes from five classes of daily activities: socializing, home repair, biking around urban areas, cooking, and home tours.
- Dataset
- JSON
ActivityNet1.2

The ActivityNet1.2 dataset is a large-scale benchmark for action recognition and localization in videos.
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

86 datasets found