Language-free Training for Zero-shot Video Grounding

doi:doi:10.57702/i73xj8ql

Language-free Training for Zero-shot Video Grounding

Given an untrimmed video and a language query, video grounding aims to localize the time interval by understanding the text and video simultaneously.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Cite this as

Dahye Kim, Jungin Park, Jiyoung Lee, Seongheon Park, Kwanghoon Sohn (2024). Dataset: Language-free Training for Zero-shot Video Grounding. https://doi.org/10.57702/i73xj8ql

DOI retrieved: December 2, 2024

Additional Info

Field	Value
Created	December 2, 2024
Last update	December 2, 2024
Defined In	https://doi.org/10.48550/arXiv.2210.12977
Author	Dahye Kim
More Authors	Jungin Park Jiyoung Lee Seongheon Park Kwanghoon Sohn