-
CD-CNN dataset
The CD-CNN dataset contains data for urban resident recognition. -
NIST SD302
NIST Special Database 302 contains plain, rolled and touch-free impressions captured from various devices. -
Kinetics400
Video classification is a fundamental problem in many video-based tasks. Applications such as autonomous driving technology, controlling drones and robots are driving the demand... -
SoBiR dataset
The SoBiR dataset is used for soft biometric retrieval. It contains 8 camera views, 100 persons, and categorical annotations. -
HMDB-51 and UCF-101
A dataset of real videos for action categorization, including HMDB-51 and UCF-101. -
CSTR VCTK Corpus
The CSTR VCTK Corpus is a dataset of speech recordings of 109 speakers, each with 20 utterances. -
Bengali Handwritten Digit Dataset
A dataset of 70000 handwritten samples of Bengali numerals for recognition using artificial neural network based architecture pre-trained by a stacked denoising autoencoder. -
Kinetics-400
Motion has shown to be useful for video understanding, where motion is typically represented by optical flow. However, computing flow from video frames is very time-consuming.... -
Librispeech
The Librispeech dataset is a large-scale speaker-dependent speech corpus containing 1080 hours of speech, 5600 utterances, and 1000 speakers.