You're currently viewing an old version of this dataset. To see the current version, click here.

MSR-VTT: A large video description dataset for bridging video and language

MSR-VTT: A large video description dataset for bridging video and language.

Data and Resources

Cite this as

Jun Xu, Tao Mei, Ting Yao, Yong Rui (2024). Dataset: MSR-VTT: A large video description dataset for bridging video and language. https://doi.org/10.57702/q9rrdq2g

DOI retrieved: December 2, 2024

Additional Info

Field Value
Created December 2, 2024
Last update December 2, 2024
Defined In https://doi.org/10.48550/arXiv.2311.18834
Author Jun Xu
More Authors
Tao Mei
Ting Yao
Yong Rui
Homepage https://www.microsoft.com/en-us/research/publication/msr-vtt-a-large-video-description-dataset-for-bridging-video-and-language/