-
Freesound Dataset
The Freesound dataset consists of 18,873 audio files, each assigned one of the 41 unique audio events from the Google's Audioset Ontology. -
COVID-19 Identiļ¬cation ResNet (CIdeR)
The COVID-19 Identiļ¬cation ResNet (CIdeR) dataset consists of 517 crowdsourced coughing and breathing audio recordings from 355 participants, of which 62 participants had tested... -
VoiceBank DEMAND dataset
Speech enhancement dataset -
TIMIT dataset
The dataset used in this paper is a collection of phonetically and phonologically local allophonic distribution in English, where voiceless stops surface as aspirated... -
VCTK Corpus
The VCTK corpus is an English multi-speaker dataset, with 44 hours of audio spoken by 109 native English speakers. -
Librispeech
The Librispeech dataset is a large-scale speaker-dependent speech corpus containing 1080 hours of speech, 5600 utterances, and 1000 speakers.