-
Multi-Mice PartsTrack dataset
The Multi-Mice PartsTrack dataset is a challenging dataset for multi-mice part tracking in videos. It contains 10 videos of two or three mice interacting freely in a home cage... -
My View is the Best View: Procedure Learning from Egocentric Videos
A dataset for procedure learning from egocentric videos. -
UCF Sports
UCF Sports dataset consists of 150 videos from sport broadcasts covering 10 action categories. -
MovieQA, TVQA, AVSD, EQA, Embodied QA
A collection of datasets for visual question answering, including MovieQA, TVQA, AVSD, EQA, and Embodied QA. -
SoccerNet-v2
SoccerNet-v2 is a large-scale dataset for action spotting in soccer videos, containing over 110K action labels. -
MPI-INF-3DHP dataset
The MPI-INF-3DHP dataset is a large-scale dataset for 3D human pose estimation in videos. It consists of 8 subjects performing 8 activities. -
Human3.6M dataset
The Human3.6M dataset is a large-scale dataset for 3D human pose estimation in videos. It consists of 3.6 million frames captured by four 50 Hz cameras. -
UCF-101 dataset for human action recognition
UCF-101 is a large-scale dataset of human actions in videos. -
Kinetics dataset
The Kinetics dataset is a large-scale action recognition dataset. It contains videos of various actions performed by humans, with annotations of the actions performed. -
CVBL video database
CVBL video database for face recognition in videos -
Tunnel Try-on
The Tunnel Try-on dataset is a collection of videos with product garment images.