-
DiscrimNet: Semi-Supervised Action Recognition from Videos using Generative A...
Propose an action recognition framework using Generative Adversarial Networks. Our model involves training a deep convolutional generative adversarial network (DCGAN) using a... -
UCF-101 dataset
UCF-101 dataset is a large-scale action recognition dataset, containing 13,320 videos categorized into 101 human action categories. -
DreaMo: Articulated 3D Reconstruction From A Single Casual Video
A dataset of 42 animal video clips with diverse species and insufficient view coverage from the Internet. -
MultiTHUMOS
Temporal action localization (TAL) is a prevailing task due to its great application potential. Existing works in this field mainly suffer from two weaknesses: (1) They often... -
TemporalMaxer: Maximize Temporal Context with only Max Pooling
Temporal action localization (TAL) is a challenging task in video understanding that aims to identify and localize actions within a video sequence. -
Kinetics-600
The Kinetics-600 dataset consists of 392k training videos and 30k validation videos in 600 human action categories. -
Kinetics Human Action Video Dataset
The Kinetics dataset is a large-scale video dataset for human action recognition. -
ActivityNet Challenge 2016
The dataset used in the paper is the ActivityNet Challenge 2016 dataset. -
Kinetics-400
Motion has shown to be useful for video understanding, where motion is typically represented by optical flow. However, computing flow from video frames is very time-consuming.... -
Motion-driven Visual Tempo Learning for Video-based Action Recognition
The proposed Temporal Correlation Module (TCM) to deal with the variation of action visual tempo in videos, which includes a Multi-scale Temporal Dynamics Module (MTDM) and a... -
VIRAT dataset
A video dataset for frame duplication detection and localization in forged videos -
ActivityNet Captions
The ActivityNet Captions is a benchmark dataset proposed for dense video captioning. There are 20K untrimmed videos in total, and each video has several annotated segments with...