-
Semi-supervised speaker adaptation for end-to-end speech synthesis
A dataset for semi-supervised speaker adaptation for end-to-end speech synthesis with pre-trained models. -
BOFFIN TTS: Few-shot Speaker Adaptation by Bayesian Optimization
BOFFIN TTS is a novel approach for few-shot speaker adaptation. The task is to fine-tune a pre-trained TTS model to mimic a new speaker using a small corpus of target utterances.