Multilevel language and vision integration for text-to-clip retrieval
Data and Resources
-
Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Cite this as
Huijuan Xu, Kun He, Bryan A Plummer, Leonid Sigal, Stan Sclaroff, Kate Saenko (2024). Dataset: Multilevel language and vision integration for text-to-clip retrieval. https://doi.org/10.57702/eyaycdb9
DOI retrieved: December 2, 2024
Additional Info
Field | Value |
---|---|
Created | December 2, 2024 |
Last update | December 2, 2024 |
Defined In | https://doi.org/10.48550/arXiv.2108.10576 |
Author | Huijuan Xu |
More Authors |
|
Homepage | https://arxiv.org/abs/1909.01696 |