BOFFIN TTS: Few-shot Speaker Adaptation by Bayesian Optimization
BOFFIN TTS is a novel approach for few-shot speaker adaptation. The task is to fine-tune a pre-trained TTS model to mimic a new speaker using a small corpus of target utterances.
BibTex: