-
Streamlined dense video captioning
Streamlined dense video captioning. -
ActivityNet 2019 Task 3: Exploring Contexts for Dense Captioning Events in Vi...
Contextual reasoning is essential to understand events in long untrimmed videos. In this work, we systematically explore different captioning models with various contexts for... -
ActivityNet Captions
The ActivityNet Captions is a benchmark dataset proposed for dense video captioning. There are 20K untrimmed videos in total, and each video has several annotated segments with...