WSJ0-2mix

The dataset used in the paper is the WSJ0-2mix dataset, which contains 30 hours of training data and 10 hours of validation data generated from the WSJ0 dataset. The speech utterances were mixed up at various signal-to-noise ratio (SNR) levels between -5 dB to 5 dB.

BibTex: