Video Understanding - Groups

VideoStreaming

A novel approach to tackle the complexities of long video understanding with large language models (LLMs). Our proposed memory-propagated streaming encoding architecture...

Dataset
JSON

Kinetics-400, UCF101, HMDB51, Something-Something V1, and Something-Something V2

The Kinetics-400, UCF101, HMDB51, Something-Something V1, and Something-Something V2 datasets are used for evaluating the performance of the Bi-Calibration Networks.

Dataset
JSON

HMDB-51

Motion has shown to be useful for video understanding, where motion is typically represented by optical flow. However, computing flow from video frames is very time-consuming....

Dataset
JSON

Kinetics-400

Motion has shown to be useful for video understanding, where motion is typically represented by optical flow. However, computing flow from video frames is very time-consuming....

Dataset
JSON

MSVD

Text-Video Retrieval (TVR) aims to align relevant video content with natural language queries. To date, most state-of-the-art TVR methods learn image-to-video transfer learning...

Dataset
JSON

MSR-VTT

The dataset used in the paper is MSR-VTT, a large video description dataset for bridging video and language. The dataset contains 10k video clips with length varying from 10 to...

Dataset
JSON

TextVid

The TextVid dataset is a textual video dataset automatically generated by advanced LLMs.

Dataset
JSON

TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-...

TOPA is a text-only pre-alignment framework for extending large language models for video understanding without the need for pre-training on real video data.

Dataset
JSON

UCF101

The UCF101 dataset contains 13320 videos distributed in 101 action categories. This dataset is different from the above ones in that it contains mostly coarse sports activities...

Dataset
JSON

Kinetics-400, Something-Something-V2, and Epic-Kitchens-100

The authors used the Kinetics-400, Something-Something-V2, and Epic-Kitchens-100 datasets for video understanding tasks.

Dataset
JSON

50 datasets found