Dataset - LDM

PRODIS

PRODIS is a speech database and a phoneme-based language model for the study of predictability effects in Polish.
- Dataset
- JSON
TSP speech database

The TSP speech database is a dataset of speech recordings.
- Dataset
- JSON
TIMIT

The TIMIT corpus is a widely used benchmark for speech recognition tasks. It contains 3,696 training utterances from 462 speakers, excluding the SA sentences. The core test set...
- Dataset
- JSON
LibriTTS

A popular text-based VC approach is to use an automatic speech recognition (ASR) model to extract phonetic posteriorgram (PPG) as content representation.
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

4 datasets found