Dataset - LDM

Million Song Dataset

Million Song Dataset is a collection of audio features and metadata for a million contemporary pop songs. Instead of storing any audio, the dataset consists of features derived...
- Dataset
- JSON
AVA-Speech

The AVA-Speech dataset is a publicly available dataset of movies densely labeled with speech activity.
- Dataset
- JSON
VoxCeleb

Speaker verification systems experience significant performance degradation when tasked with short-duration trial recordings. To address this challenge, a multi-scale feature...
- Dataset
- JSON
AudioMNIST

The AudioMNIST dataset consists of 60 speakers, 33% female, who were recorded speaking individual digits (0-9) 50 times each.
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

4 datasets found