4 datasets found

Tags: speech corpus

Filter Results
  • TIMIT Corpus

    The TIMIT corpus is a large database of speech recordings used for speaker recognition and speech synthesis tasks.
  • AISHELL-3

    The Mandarin dataset comprises over 88,000 read utterances and roughly 85 hours of speech data.
  • Buckeye Speech Corpus

    The English dataset consists of approximately 300,000 words spoken by 40 speakers from Central Ohio in conversational settings with an interviewer.
  • LibriTTS

    A popular text-based VC approach is to use an automatic speech recognition (ASR) model to extract phonetic posteriorgram (PPG) as content representation.
You can also access this registry using the API (see API Docs).