-
Million Song Dataset
Million Song Dataset is a collection of audio features and metadata for a million contemporary pop songs. Instead of storing any audio, the dataset consists of features derived... -
AVA-Speech
The AVA-Speech dataset is a publicly available dataset of movies densely labeled with speech activity. -
AudioMNIST
The AudioMNIST dataset consists of 60 speakers, 33% female, who were recorded speaking individual digits (0-9) 50 times each.