-
EPIC-KITCHENS
EPIC-KITCHENS is a large-scale egocentric video benchmark recorded by 32 participants in their native kitchen environments. Our videos depict non-scripted daily activities: we... -
Temporal Sentence Grounding in Videos
Temporal sentence grounding in videos (TSGV) is a task to retrieve a video segment that semantically corresponds to a query in natural language. -
WildDeepFakes
A challenging real-world dataset for deepfake detection. -
Temporal Deepfake Segment Benchmark
A deepfake detection method that can address the issue of modifying segments of videos using generative techniques. -
Agreement ADOS database, Kaggle database, and self-gathered video test dataset
The AGRE ADOS database, Kaggle database, and a self-gathered video test dataset with corresponding ADOS data -
InternVid: A Large-Scale Video-Text Dataset for Multimodal Understanding and ...
InternVid: A large-scale video-text dataset for multimodal understanding and generation. -
THUMOS Challenge: Action Recognition with a Large Number of Classes
THUMOS Challenge: Action Recognition with a Large Number of Classes. -
ActivityNet-1.3
Generating human action proposals in untrimmed videos is an important yet challenging task with wide applications. Current methods often suffer from the noisy boundary locations... -
SDU Fall Dataset
The dataset contains videos of people performing normal activities and falls. -
UR Dataset
The dataset contains videos of people performing normal activities and falls. -
Spatio-Temporal Adversarial Learning for Detecting Unseen Falls
The proposed spatio-temporal adversarial learning framework for detecting unseen falls from videos. -
SEWA: A Large-Scale Video Dataset for Affective Computing
The SEWA dataset contains video clips annotated with facial landmarks, valence, and arousal. -
AFEW-VA: A Database for Valence and Arousal Estimation in-the-Wild
The AFEW-VA dataset contains video clips annotated with valence and arousal. -
ActivityNet v1.3
Temporal action proposal generation is an important task, akin to object proposals, temporal action proposals are intended to capture “clips” or temporal intervals in videos...