Dataset - LDM

ActivityNet-1.3

Generating human action proposals in untrimmed videos is an important yet challenging task with wide applications. Current methods often suffer from the noisy boundary locations...
- Dataset
- JSON
ActivityNet, MSR-VTT, and MSVD

The dataset used in the paper is ActivityNet, MSR-VTT, and MSVD. The authors used these datasets for text-to-video retrieval tasks.
- Dataset
- JSON
ActivityNet v1.2

Weakly-Supervised Temporal Action Localization (WSTAL) aims to localize actions in untrimmed videos with only video-level labels.
- Dataset
- JSON
ActivityNet Captions

The ActivityNet Captions is a benchmark dataset proposed for dense video captioning. There are 20K untrimmed videos in total, and each video has several annotated segments with...
- Dataset
- JSON
ActivityNet

Temporal activity detection has drawn increasing interests in both academic and industry communities due to its vast potential applications in security surveillance, behavior...
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

5 datasets found