4 datasets found

Formats: JSON Tags: Speech Data

Filter Results
  • UASPEECH

    The UASPEECH dataset is a dataset of speech recordings.
  • The LJ Speech Dataset

    The LJ-Speech dataset is a dataset of speech recordings of a female speaker.
  • VCTK

    Voice conversion (VC) is a technique that alters the voice of a source speaker to a target style, such as speaker identity, prosody, and emotion, while keeping the linguistic...
  • LibriTTS

    A popular text-based VC approach is to use an automatic speech recognition (ASR) model to extract phonetic posteriorgram (PPG) as content representation.
You can also access this registry using the API (see API Docs).