-
Contrastive Multiple Instance Learning for Weakly Supervised Person ReID
The acquisition of large-scale, precisely labeled datasets for person re-identification (ReID) poses a significant challenge. Weakly supervised ReID has begun to address this... -
Weakly Supervised Gaussian Networks for Action Detection
Detecting temporal extents of human actions in videos is a challenging computer vision problem that requires detailed manual supervision including frame-level labels. -
XD-Violence
The XD-Violence dataset is a large-scale multimodal video dataset for violence detection. It consists of 4,754 untrimmed videos with a total duration of 217 hours, covering six... -
Multi-scale Bottleneck Transformer for Weakly Supervised Multimodal Violence ...
Weakly supervised multimodal violence detection aims to learn a violence detection model by leveraging multiple modalities such as RGB, optical flow, and audio, while only... -
DeepCap: Monocular Human Performance Capture Using Weak Supervision
A dataset for human performance capture using weak supervision.