-
MoSeg: A Dataset for Object Segmentation in Videos
MoSeg is a dataset for object segmentation in videos. -
UCSD Pedestrian
The dataset used for local anomaly detection in videos using object-centric adversarial learning. -
Kinetics: A Large-Scale Video Benchmark for Human Action Recognition
A large-scale video benchmark for human action recognition. -
Video-Specific Query-Key Attention Modeling for Weakly-Supervised Temporal Ac...
Weakly-supervised temporal action localization aims to identify and localize the action instances in the untrimmed videos with only video-level action labels. -
YouTube Clickbait Detection Dataset
The dataset is a collection of online videos from YouTube, with comments and metadata. It is used to evaluate the performance of the Online Video Clickbait Protector (OVCP) scheme. -
Subway Exit
The Subway Exit dataset contains video clips of people leaving a train station. -
Subway Entrance
The Subway Entrance dataset contains video clips of people waiting to board a train. -
Vcdb: A Large-Scale Database for Partial Copy Detection in Videos
A large-scale database for partial copy detection in videos. -
Tinyvideos dataset
Tinyvideos dataset -
VisDrone2019 Dataset
The VisDrone2019 dataset contains 288 video clips made up of 261,908 frames and 10,209 images -
Day-to-Day Video Dataset
A dataset of 30 videos of length 3 minutes to 20 minutes from five classes of daily activities: socializing, home repair, biking around urban areas, cooking, and home tours. -
ActivityNet1.2
The ActivityNet1.2 dataset is a large-scale benchmark for action recognition and localization in videos. -
Temporal Sentence Grounding in Videos
Temporal sentence grounding in videos (TSGV) is a task to retrieve a video segment that semantically corresponds to a query in natural language. -
InternVid: A Large-Scale Video-Text Dataset for Multimodal Understanding and ...
InternVid: A large-scale video-text dataset for multimodal understanding and generation. -
THUMOS Challenge: Action Recognition with a Large Number of Classes
THUMOS Challenge: Action Recognition with a Large Number of Classes. -
ActivityNet-1.3
Generating human action proposals in untrimmed videos is an important yet challenging task with wide applications. Current methods often suffer from the noisy boundary locations... -
SDU Fall Dataset
The dataset contains videos of people performing normal activities and falls. -
UR Dataset
The dataset contains videos of people performing normal activities and falls.