-
MoSeg: A Dataset for Object Segmentation in Videos
MoSeg is a dataset for object segmentation in videos. -
UCSD Pedestrian
The dataset used for local anomaly detection in videos using object-centric adversarial learning. -
Kinetics: A Large-Scale Video Benchmark for Human Action Recognition
A large-scale video benchmark for human action recognition. -
Video-Specific Query-Key Attention Modeling for Weakly-Supervised Temporal Ac...
Weakly-supervised temporal action localization aims to identify and localize the action instances in the untrimmed videos with only video-level action labels. -
YouTube Clickbait Detection Dataset
The dataset is a collection of online videos from YouTube, with comments and metadata. It is used to evaluate the performance of the Online Video Clickbait Protector (OVCP) scheme. -
Subway Exit
The Subway Exit dataset contains video clips of people leaving a train station. -
Subway Entrance
The Subway Entrance dataset contains video clips of people waiting to board a train. -
OTB2013, OTB2015, VOT2015, VOT2016
Visual object tracking, which tracks a specified target in a changing video sequence automatically, is a fundamental problem in many topics such as visual analysis, automatic... -
WorldExpo'10
A crowd counting dataset with over 1000 labeled videos captured by over 100 monitoring cameras. -
Crowd Video Captioning Dataset
A crowd video captioning dataset based on the WorldExpo'10 dataset, with 98 videos selected and captions generated for them. -
Vcdb: A Large-Scale Database for Partial Copy Detection in Videos
A large-scale database for partial copy detection in videos. -
Breast Ultrasound Video Diagnosis
The dataset is used for breast cancer diagnosis in ultrasound videos. -
Tinyvideos dataset
Tinyvideos dataset -
VisDrone2019 Dataset
The VisDrone2019 dataset contains 288 video clips made up of 261,908 frames and 10,209 images -
Day-to-Day Video Dataset
A dataset of 30 videos of length 3 minutes to 20 minutes from five classes of daily activities: socializing, home repair, biking around urban areas, cooking, and home tours. -
ActivityNet1.2
The ActivityNet1.2 dataset is a large-scale benchmark for action recognition and localization in videos.