-
TSP speech database
The TSP speech database is a dataset of speech recordings. -
Isolet dataset
The dataset used in this paper is the Isolet dataset, which contains 4,000 13-channel audio recordings of 100 speakers. -
Freesound Dataset
The Freesound dataset consists of 18,873 audio files, each assigned one of the 41 unique audio events from the Google's Audioset Ontology. -
Semi Supervised Learning for Few-Shot Audio Classification by Episodic Triple...
Few-shot learning aims to generalize unseen classes that appear during testing but are unavailable during training. The performance of prototypical networks in extreme few-shot... -
COVID-19 Identification ResNet (CIdeR)
The COVID-19 Identification ResNet (CIdeR) dataset consists of 517 crowdsourced coughing and breathing audio recordings from 355 participants, of which 62 participants had tested... -
VoiceBank DEMAND dataset
Speech enhancement dataset -
TIMIT dataset
The dataset used in this paper is a collection of phonetically and phonologically local allophonic distribution in English, where voiceless stops surface as aspirated... -
Schizophrenia Spectrum Dataset
The dataset used for this study was collected for a mental health assessment project conducted at the University of Maryland School of Medicine in collaboration with the... -
LJSpeech-1.1
The LJSpeech-1.1 dataset is a large-scale speech dataset containing approximately 24 hours of single-speaker speech recorded at 22 050 Hz.