Video Analysis - Groups

Learning to Predict Situation Hyper-Graphs for Video Question Answering

The SHG-VQA model predicts a situation hyper-graph structure composed of existing actions and relations in the input video.

Dataset
JSON

Action Search: Spotting Actions in Videos

This paper proposes a method for action search in videos, which is used for spotting actions in videos.

Dataset
JSON

Breakfast dataset

The Breakfast dataset is another dataset used in the paper, which contains 712 videos of people performing various activities, such as making coffee or scrambling eggs. The...

Dataset
JSON

AViD Dataset: Anonymized Videos from Diverse Countries

AViD is a new public video dataset for action recognition, containing action videos from diverse countries.

Dataset
JSON

THUMOS'13 Dataset

The THUMOS'13 dataset contains 24 classes and 3,207 videos.

Dataset
JSON

JHMDB Dataset

The JHMDB dataset contains a varying number of actions (21 in JHMDB) across different domains (sports and daily activities).

Dataset
JSON

UCF Sports Dataset

The UCF Sports dataset contains a varying number of actions (10 in UCF Sports, 21 in JHMDB, and 24 in THUMOS’13) across different domains (sports and daily activities).

Dataset
JSON

THUMOS'14, ActivityNet v1.3

Temporal action detection in untrimmed videos via multi-stage cnns, Cdc: convolutional-de-convolutional networks for precise temporal action localization, Temporal action...

Dataset
JSON

SCUBA and SCUFO

The dataset used in the paper to evaluate static bias in action representations.

Dataset
JSON

ActivityNet-1.3

Generating human action proposals in untrimmed videos is an important yet challenging task with wide applications. Current methods often suffer from the noisy boundary locations...

Dataset
JSON

UCF-24 and JHMDB-21

UCF-24 and JHMDB-21 are two public action datasets used for evaluation of action detection algorithms.

Dataset
JSON

Kinetics dataset

The Kinetics dataset is a large-scale action recognition dataset. It contains videos of various actions performed by humans, with annotations of the actions performed.

Dataset
JSON

UCF-101 dataset

UCF-101 dataset is a large-scale action recognition dataset, containing 13,320 videos categorized into 101 human action categories.

Dataset
JSON

Jester

The Jester dataset is of continuous jokes ratings from -10 to 10, containing the jokes’ texts.

Dataset
JSON

Kinetics-600

The Kinetics-600 dataset consists of 392k training videos and 30k validation videos in 600 human action categories.

Dataset
JSON

Kinetics Human Action Video Dataset

The Kinetics dataset is a large-scale video dataset for human action recognition.

Dataset
JSON

Kinetics-400

Motion has shown to be useful for video understanding, where motion is typically represented by optical flow. However, computing flow from video frames is very time-consuming....

Dataset
JSON

Motion-driven Visual Tempo Learning for Video-based Action Recognition

The proposed Temporal Correlation Module (TCM) to deal with the variation of action visual tempo in videos, which includes a Multi-scale Temporal Dynamics Module (MTDM) and a...

Dataset
JSON

UCF101

The UCF101 dataset contains 13320 videos distributed in 101 action categories. This dataset is different from the above ones in that it contains mostly coarse sports activities...

Dataset
JSON

HMDB51

Video classification is a fundamental problem in many video-based tasks. Applications such as autonomous driving technology, controlling drones and robots are driving the demand...

Dataset
JSON

23 datasets found