Language-free Training for Zero-shot Video Grounding

Given an untrimmed video and a language query, video grounding aims to localize the time interval by understanding the text and video simultaneously.

Data and Resources

Cite this as

Dahye Kim, Jungin Park, Jiyoung Lee, Seongheon Park, Kwanghoon Sohn (2024). Dataset: Language-free Training for Zero-shot Video Grounding. https://doi.org/10.57702/i73xj8ql

DOI retrieved: December 2, 2024

Additional Info

Field Value
Created December 2, 2024
Last update December 2, 2024
Defined In https://doi.org/10.48550/arXiv.2210.12977
Author Dahye Kim
More Authors
Jungin Park
Jiyoung Lee
Seongheon Park
Kwanghoon Sohn