MSR-VTT: A large video description dataset for bridging video and language

doi:doi:10.57702/q9rrdq2g

You're currently viewing an old version of this dataset. To see the current version, click here.

MSR-VTT: A large video description dataset for bridging video and language

MSR-VTT: A large video description dataset for bridging video and language.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Cite this as

Jun Xu, Tao Mei, Ting Yao, Yong Rui (2024). Dataset: MSR-VTT: A large video description dataset for bridging video and language. https://doi.org/10.57702/q9rrdq2g

DOI retrieved: December 2, 2024

Additional Info

Field	Value
Created	December 2, 2024
Last update	December 2, 2024
Defined In	https://doi.org/10.48550/arXiv.2311.18834
Author	Jun Xu
More Authors	Tao Mei Ting Yao Yong Rui
Homepage	https://www.microsoft.com/en-us/research/publication/msr-vtt-a-large-video-description-dataset-for-bridging-video-and-language/