-
Wall Street Journal
The Wall Street Journal dataset is used for syntactic linearization. It contains a large corpus of news articles with their corresponding syntactic trees. -
ASVspoof2019
The ASVspoof2019 LA subset consists of three parts, training, development, and evaluation. Each partition has a disjoint set of speakers. The average duration of the utterances... -
TIMIT dataset
The dataset used in this paper is a collection of phonetically and phonologically local allophonic distribution in English, where voiceless stops surface as aspirated... -
MSP-IMPROV
The MSP-IMPROV dataset contains 6 sessions of dyadic interactions between pairs of male-female actors. 15 target sentences are used to collect the recordings. For each target... -
Speech Commands
The Speech Commands dataset consists of 105809 one-second audio recordings of 35 spoken words sampled at 16kHz. The raw speech commands dataset presents audio recordings as a... -
LJSpeech Dataset
The LJSpeech dataset is a collection of audio recordings of a single female speaker reading aloud. -
LJ Speech Dataset
The LJ speech dataset is a dataset of speech samples recorded from a single speaker reading passages from 7 non-fiction books. -
LibriSpeech dataset
The dataset used in the paper is the LibriSpeech dataset, which contains about 1,000 hours of English speech derived from audiobooks.