DiDeMo

The DiDeMo dataset is a large-scale video-text dataset, containing 10,000 videos and 40,000 annotations.

Data and Resources

Cite this as

H. Luo, L. Ji, M. Zhong, Y. Chen, W. Lei, D. Duan, T. Li, J. Bharti, M. Zhou (2024). Dataset: DiDeMo. https://doi.org/10.57702/d8wjkgdy

DOI retrieved: December 2, 2024

Additional Info

Field Value
Created December 2, 2024
Last update December 2, 2024
Defined In https://doi.org/10.1109/TPAMI.2023.3258628
Citation
  • https://doi.org/10.48550/arXiv.2307.09972
  • https://doi.org/10.48550/arXiv.2404.13425
  • https://doi.org/10.1609/aaai.v37i3.25483
Author H. Luo
More Authors
L. Ji
M. Zhong
Y. Chen
W. Lei
D. Duan
T. Li
J. Bharti
M. Zhou
Homepage https://demos.csail.mit.edu/