-
PHOENIX-2014T
PHOENIX-2014T is a German sign language (DGS) dataset that mainly includes weather forecast content from TV programs. It consists of 7096, 519, and 642 video text pairs in... -
SEDS: Semantically Enhanced Dual-Stream Encoder for Sign Language Retrieval
Sign language retrieval is more biased towards understanding the semantic information of human actions contained in video clips. The proposed framework addresses these issues by... -
ActivityNet v1.2
Weakly-Supervised Temporal Action Localization (WSTAL) aims to localize actions in untrimmed videos with only video-level labels.