Video Analysis - Groups

MoSeg: A Dataset for Object Segmentation in Videos

MoSeg is a dataset for object segmentation in videos.
- Dataset
- JSON
UCSD Pedestrian

The dataset used for local anomaly detection in videos using object-centric adversarial learning.
- Dataset
- JSON
Kinetics: A Large-Scale Video Benchmark for Human Action Recognition

A large-scale video benchmark for human action recognition.
- Dataset
- JSON
Video-Specific Query-Key Attention Modeling for Weakly-Supervised Temporal Ac...

Weakly-supervised temporal action localization aims to identify and localize the action instances in the untrimmed videos with only video-level action labels.
- Dataset
- JSON
YouTube Clickbait Detection Dataset

The dataset is a collection of online videos from YouTube, with comments and metadata. It is used to evaluate the performance of the Online Video Clickbait Protector (OVCP) scheme.
- Dataset
- JSON
Subway Exit

The Subway Exit dataset contains video clips of people leaving a train station.
- Dataset
- JSON
Subway Entrance

The Subway Entrance dataset contains video clips of people waiting to board a train.
- Dataset
- JSON
CityCam

CityCam dataset contains recordings from 212 trafﬁc cameras in the United States, with a resolution of 352x240 and a frame rate of 1 frame/per second.
- Dataset
- JSON
Vcdb: A Large-Scale Database for Partial Copy Detection in Videos

A large-scale database for partial copy detection in videos.
- Dataset
- JSON
Tinyvideos dataset

Tinyvideos dataset
- Dataset
- JSON
STV-IDL

The STV-IDL dataset is a collection of videos with referring expressions that describe spatio-temporal events.
- Dataset
- JSON
VisDrone2019 Dataset

The VisDrone2019 dataset contains 288 video clips made up of 261,908 frames and 10,209 images
- Dataset
- JSON
Day-to-Day Video Dataset

A dataset of 30 videos of length 3 minutes to 20 minutes from five classes of daily activities: socializing, home repair, biking around urban areas, cooking, and home tours.
- Dataset
- JSON
ActivityNet1.2

The ActivityNet1.2 dataset is a large-scale benchmark for action recognition and localization in videos.
- Dataset
- JSON
Temporal Sentence Grounding in Videos

Temporal sentence grounding in videos (TSGV) is a task to retrieve a video segment that semantically corresponds to a query in natural language.
- Dataset
- JSON
InternVid: A Large-Scale Video-Text Dataset for Multimodal Understanding and ...

InternVid: A large-scale video-text dataset for multimodal understanding and generation.
- Dataset
- JSON
THUMOS Challenge: Action Recognition with a Large Number of Classes

THUMOS Challenge: Action Recognition with a Large Number of Classes.
- Dataset
- JSON
ActivityNet-1.3

Generating human action proposals in untrimmed videos is an important yet challenging task with wide applications. Current methods often suffer from the noisy boundary locations...
- Dataset
- JSON
SDU Fall Dataset

The dataset contains videos of people performing normal activities and falls.
- Dataset
- JSON
UR Dataset

The dataset contains videos of people performing normal activities and falls.
- Dataset
- JSON

56 datasets found