-
Semi Supervised Learning for Few-Shot Audio Classification by Episodic Triple...
Few-shot learning aims to generalize unseen classes that appear during testing but are unavailable during training. The performance of prototypical networks in extreme few-shot... -
COVID-19 Cough Sub-Challenge
The dataset is used for automatic diagnosis of Covid-19 from crowdsourced respiratory sound data. -
Primate Vocalisations Corpus
The dataset is used for automated species classification of primates. -
Audio Set: An ontology and human-labeled dataset for audio events
The authors used the AudioSet dataset for testing their models. -
Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers
The Audio Spectrogram Transformer (AST) model is used for audio classification tasks. -
TAU Urban Acoustic Scenes 2019
The dataset used for acoustic scene classification task. -
DCASE 2019
The dataset used for acoustic scene classification, sound event detection and image classification tasks. -
VoxCeleb dataset
The VoxCeleb dataset is a large-scale speaker identification dataset, used to evaluate the performance of face recognition systems. -
SemanticAC: SEMANTICS-ASSISTED FRAMEWORK FOR AUDIO CLASSIFICATION
A semantics-assisted framework for audio classification to better leverage the semantic information. -
Speech Commands Dataset
The dataset used for training the keyword spotting model is the ESC: Dataset for Environmental Sound Classification, and the Speech Commands Dataset. -
Speech Commands
The Speech Commands dataset consists of 105809 one-second audio recordings of 35 spoken words sampled at 16kHz. The raw speech commands dataset presents audio recordings as a...