-
Video-Specific Query-Key Attention Modeling for Weakly-Supervised Temporal Ac...
Weakly-supervised temporal action localization aims to identify and localize the action instances in the untrimmed videos with only video-level action labels. -
HACS and Epic-Kitchens-100
The authors used the HACS and Epic-Kitchens-100 datasets for action localization tasks.