Speech Corpus

Corpus of Spontaneous Japanese

The Corpus of Spontaneous Japanese: Its design and evaluation [30] is a dataset of spontaneous Japanese speech.

Dataset
JSON

PERCEPT-R audio Corpus

The PERCEPT-R audio Corpus is a collection of audio files of children and adults speaking American English.

Dataset
JSON

AISHELL-1

The AISHELL-1 dataset is a Mandarin speech corpus, consisting of 178 hours of speech, with 11 domains and 400 speakers from different accent areas in China.

Dataset
JSON

LaMIT corpus

The LaMIT corpus is a speech corpus for Italian, created and labeled specifically for this work.

Dataset
JSON

LaMIT database

The LaMIT database is a speech corpus for Italian, created and labeled specifically for this work.

Dataset
JSON

WSJ0-mix dataset

The WSJ0-mix dataset contains a min version of 2-, 3-, 4-, and 5-speaker mixtures simulated using clean speech in the WSJ0 corpus.

Dataset
JSON

TED-LIUM 3

TED-LIUM 3 (TL3) is a TED talks dataset. Speaker adaptation data for TL3 was divided randomly, where 2/5 was divided into the train set, 1/5 was divided into the dev set, and...

Dataset
JSON

A speech corpus of size 7,000 used for training and validation of the FCI module.

Dataset
JSON

TIMIT Corpus

The TIMIT corpus is a large database of speech recordings used for speaker recognition and speech synthesis tasks.

Dataset
JSON

WSJ corpus

The WSJ corpus contains 81.48 hours of speech from 283 adults.

Dataset
JSON

TIMIT

The TIMIT corpus is a widely used benchmark for speech recognition tasks. It contains 3,696 training utterances from 462 speakers, excluding the SA sentences. The core test set...

Dataset
JSON

speechocean762

speechocean762: An open-source non-native English speech corpus for pronunciation assessment.

Dataset
JSON

AISHELL-3

The Mandarin dataset comprises over 88,000 read utterances and roughly 85 hours of speech data.

Dataset
JSON

Buckeye Speech Corpus

The English dataset consists of approximately 300,000 words spoken by 40 speakers from Central Ohio in conversational settings with an interviewer.

Dataset
JSON

HKUST/MTS: A Very Large Scale Mandarin Telephone Speech Corpus

The HKUST dataset is a large dataset of speech recordings, each containing a single speaker speaking a sentence.

Dataset
JSON

The Wall Street Journal Corpus

The WSJ dataset is a large dataset of speech recordings, each containing a single speaker speaking a sentence.

Dataset
JSON

TIMIT Acoustic-Phonetic Continuous Speech Corpus

The TIMIT acoustic-phonetic continuous speech corpusCD-ROM contains a large collection of speech samples from 250 male and 250 female speakers.

Dataset
JSON

Voice Bank speech corpus

The Voice Bank speech corpus is a selection of ten British English speakers – both male and female – from the Voice Bank speech corpus, each of which has around 400 clean...

Dataset
JSON

Chinese Standard Mandarin Speech Corpus (CSMSC)

The Chinese Standard Mandarin Speech Corpus (CSMSC) is a large-scale speech corpus containing 10,000 recorded sentences read by a female speaker.

Dataset
JSON

Librispeech

The Librispeech dataset is a large-scale speaker-dependent speech corpus containing 1080 hours of speech, 5600 utterances, and 1000 speakers.

Dataset
JSON

24 datasets found