Audio Datasets - Groups

TIMIT dataset

The dataset used in this paper is a collection of phonetically and phonologically local allophonic distribution in English, where voiceless stops surface as aspirated...

Dataset
JSON

VCTK Dataset

The VCTK dataset is a large corpus of speech recordings, each containing a single speaker and a single sentence.

Dataset
JSON

LJSpeech Dataset

The LJSpeech dataset is a collection of audio recordings of a single female speaker reading aloud.

Dataset
JSON

LJSpeech and VCTK datasets

The LJSpeech dataset contains 13,100 22kHz audio clips of a female speaker. The VCTK dataset consists of 108 native English speakers with various accents.

Dataset
JSON

4 datasets found

TIMIT dataset

VCTK Dataset

LJSpeech Dataset

LJSpeech and VCTK datasets