SEDS: Semantically Enhanced Dual-Stream Encoder for Sign Language Retrieval
Sign language retrieval is more biased towards understanding the semantic information of human actions contained in video clips. The proposed framework addresses these issues by integrating Pose and RGB modalities to represent the local and global information of sign language videos.
BibTex: