-
Audio Data
The dataset contains audio data from various sources, including podcasts, audiobooks, and voice assistants. -
TIMIT Acoustic-Phonetic Continuous Speech Corpus
The TIMIT acoustic-phonetic continuous speech corpusCD-ROM contains a large collection of speech samples from 250 male and 250 female speakers. -
The LJ Speech Dataset
The LJ-Speech dataset is a dataset of speech recordings of a female speaker. -
Librispeech
The Librispeech dataset is a large-scale speaker-dependent speech corpus containing 1080 hours of speech, 5600 utterances, and 1000 speakers.