Speech Enhancement - Groups

VCTK Corpus

The VCTK corpus is an English multi-speaker dataset, with 44 hours of audio spoken by 109 native English speakers.

Dataset
JSON

Using power level difference for near field dual-microphone speech enhancement

Dataset
JSON

Custom Dataset

The authors created a custom dataset for their experiment, consisting of 33,000 images of 320 possible object-image combinations, with 10 possible shapes, 8 possible colors, 2...

Dataset
JSON

VoiceBank-DEMAND

The VoiceBank-DEMAND dataset is a standard benchmark for speech denoising systems. It consists of 28 speakers with 4 signal-to-noise ratios (SNR) (15, 10, 5, and 0 dB) and...

Dataset
JSON

Diffusion-based speech enhancement with a weighted generative-supervised lear...

Diffusion-based speech enhancement with a weighted generative-supervised learning loss

Dataset
JSON

DNS Blind Test Set

The DNS challenge provides a blind test set for both non-personalized and personalized DNS models.

Dataset
JSON

DNS Training Datasets

The DNS challenge provides clean speech, noise, impulse responses, and a training data synthesizer for both non-personalized and personalized DNS models.

Dataset
JSON

27 datasets found