Dataset - LDM

UASPEECH

The UASPEECH dataset is a dataset of speech recordings.
- Dataset
- JSON
The LJ Speech Dataset

The LJ-Speech dataset is a dataset of speech recordings of a female speaker.
- Dataset
- JSON
VCTK

Voice conversion (VC) is a technique that alters the voice of a source speaker to a target style, such as speaker identity, prosody, and emotion, while keeping the linguistic...
- Dataset
- JSON
LibriTTS

A popular text-based VC approach is to use an automatic speech recognition (ASR) model to extract phonetic posteriorgram (PPG) as content representation.
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

4 datasets found