Cite this as

Hyounghun Kim, Zineng Tang, Mohit Bansal (2024). Dataset: Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA. Resource: Original Metadata. https://doi.org/10.57702/vzyk3wds

DOI retrieved: December 16, 2024

Additional Information

Field Value
Created December 16, 2024
Last updated December 16, 2024
Format JSON