Zero-Shot Temporal Action Detection via Vision-Language Prompting

Zero-Shot Temporal Action Detection via Vision-Language Prompting (STALE) model for the under-studied yet practically useful zero-shot temporal action detection (ZS-TAD)

Data and Resources

Cite this as

Sauradip Nag, Xiatian Zhu, Yi-Zhe Song, Tao Xiang (2024). Dataset: Zero-Shot Temporal Action Detection via Vision-Language Prompting. https://doi.org/10.57702/64yrzs7p

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Defined In https://doi.org/10.48550/arXiv.2207.08184
Author Sauradip Nag
More Authors
Xiatian Zhu
Yi-Zhe Song
Tao Xiang
Homepage https://github.com/sauradip/STALE