Multilevel language and vision integration for text-to-clip retrieval

Multilevel language and vision integration for text-to-clip retrieval

Data and Resources

Cite this as

Huijuan Xu, Kun He, Bryan A Plummer, Leonid Sigal, Stan Sclaroff, Kate Saenko (2024). Dataset: Multilevel language and vision integration for text-to-clip retrieval. https://doi.org/10.57702/eyaycdb9

DOI retrieved: December 2, 2024

Additional Info

Field Value
Created December 2, 2024
Last update December 2, 2024
Defined In https://doi.org/10.48550/arXiv.2108.10576
Author Huijuan Xu
More Authors
Kun He
Bryan A Plummer
Leonid Sigal
Stan Sclaroff
Kate Saenko
Homepage https://arxiv.org/abs/1909.01696