-
Explainable Deep Clustering for Monaural Speech Separation
The proposed X-DC model uses a dataset of mixed speech signals of two, four, or eight speakers. -
Continuous speech separation: Dataset and analysis
Continuous speech separation: Dataset and analysis. -
Generative Pre-Training for Speech
Generative models have gained more and more attention in recent years for their remarkable success in tasks that required estimating and sampling data distribution to generate...