ActivityNet 2019 Task 3: Exploring Contexts for Dense Captioning Events in Videos

doi:doi:10.57702/qmxe5ovb

ActivityNet 2019 Task 3: Exploring Contexts for Dense Captioning Events in Videos

Contextual reasoning is essential to understand events in long untrimmed videos. In this work, we systematically explore different captioning models with various contexts for the dense-captioning events in video task, which aims to generate captions for different events in the untrimmed video.

BibTex:

@dataset{Shizhe_Chen_and_Yuqing_Song_and_Yida_Zhao_and_Qin_Jin_and_Zhaoyang_Zeng_and_Bei_Liu_and_Jianlong_Fu_and_Alexander_Hauptmann_2024,
    abstract = {Contextual reasoning is essential to understand events in long untrimmed videos. In this work, we systematically explore different captioning models with various contexts for the dense-captioning events in video task, which aims to generate captions for different events in the untrimmed video.},
    author = {Shizhe Chen and Yuqing Song and Yida Zhao and Qin Jin and Zhaoyang Zeng and Bei Liu and Jianlong Fu and Alexander Hauptmann},
    doi = {10.57702/qmxe5ovb},
    institution = {No Organization},
    keyword = {'contextual reasoning', 'dense video captioning', 'event captioning'},
    month = {dec},
    publisher = {TIB},
    title = {ActivityNet 2019 Task 3: Exploring Contexts for Dense Captioning Events in Videos},
    url = {https://service.tib.eu/ldmservice/dataset/activitynet-2019-task-3--exploring-contexts-for-dense-captioning-events-in-videos},
    year = {2024}
}