-
VCTK Corpus
The VCTK corpus is an English multi-speaker dataset, with 44 hours of audio spoken by 109 native English speakers. -
Using power level difference for near field dual-microphone speech enhancement
Using power level difference for near field dual-microphone speech enhancement -
Custom Dataset
The authors created a custom dataset for their experiment, consisting of 33,000 images of 320 possible object-image combinations, with 10 possible shapes, 8 possible colors, 2... -
VoiceBank-DEMAND
The VoiceBank-DEMAND dataset is a standard benchmark for speech denoising systems. It consists of 28 speakers with 4 signal-to-noise ratios (SNR) (15, 10, 5, and 0 dB) and... -
Diffusion-based speech enhancement with a weighted generative-supervised lear...
Diffusion-based speech enhancement with a weighted generative-supervised learning loss -
DNS Blind Test Set
The DNS challenge provides a blind test set for both non-personalized and personalized DNS models. -
DNS Training Datasets
The DNS challenge provides clean speech, noise, impulse responses, and a training data synthesizer for both non-personalized and personalized DNS models.