-
Corpus of Spontaneous Japanese
The Corpus of Spontaneous Japanese: Its design and evaluation [30] is a dataset of spontaneous Japanese speech. -
PERCEPT-R audio Corpus
The PERCEPT-R audio Corpus is a collection of audio files of children and adults speaking American English. -
LaMIT corpus
The LaMIT corpus is a speech corpus for Italian, created and labeled specifically for this work. -
LaMIT database
The LaMIT database is a speech corpus for Italian, created and labeled specifically for this work. -
WSJ0-mix dataset
The WSJ0-mix dataset contains a min version of 2-, 3-, 4-, and 5-speaker mixtures simulated using clean speech in the WSJ0 corpus. -
TED-LIUM 3
TED-LIUM 3 (TL3) is a TED talks dataset. Speaker adaptation data for TL3 was divided randomly, where 2/5 was divided into the train set, 1/5 was divided into the dev set, and... -
Speech Corpus
A speech corpus of size 7,000 used for training and validation of the FCI module. -
TIMIT Corpus
The TIMIT corpus is a large database of speech recordings used for speaker recognition and speech synthesis tasks. -
WSJ corpus
The WSJ corpus contains 81.48 hours of speech from 283 adults. -
speechocean762
speechocean762: An open-source non-native English speech corpus for pronunciation assessment. -
Buckeye Speech Corpus
The English dataset consists of approximately 300,000 words spoken by 40 speakers from Central Ohio in conversational settings with an interviewer. -
HKUST/MTS: A Very Large Scale Mandarin Telephone Speech Corpus
The HKUST dataset is a large dataset of speech recordings, each containing a single speaker speaking a sentence. -
The Wall Street Journal Corpus
The WSJ dataset is a large dataset of speech recordings, each containing a single speaker speaking a sentence. -
TIMIT Acoustic-Phonetic Continuous Speech Corpus
The TIMIT acoustic-phonetic continuous speech corpusCD-ROM contains a large collection of speech samples from 250 male and 250 female speakers. -
Voice Bank speech corpus
The Voice Bank speech corpus is a selection of ten British English speakers – both male and female – from the Voice Bank speech corpus, each of which has around 400 clean... -
Chinese Standard Mandarin Speech Corpus (CSMSC)
The Chinese Standard Mandarin Speech Corpus (CSMSC) is a large-scale speech corpus containing 10,000 recorded sentences read by a female speaker. -
Librispeech
The Librispeech dataset is a large-scale speaker-dependent speech corpus containing 1080 hours of speech, 5600 utterances, and 1000 speakers.