Weakly Supervised Gaussian Networks for Action Detection

Detecting temporal extents of human actions in videos is a challenging computer vision problem that requires detailed manual supervision including frame-level labels.

BibTex: