BOFFIN TTS: Few-shot Speaker Adaptation by Bayesian Optimization

BOFFIN TTS is a novel approach for few-shot speaker adaptation. The task is to fine-tune a pre-trained TTS model to mimic a new speaker using a small corpus of target utterances.

Data and Resources

Cite this as

Henry B. Moss, Vatsal Aggarwal, Nishant Prateek, Javier González, Roberto Barra-Chicote (2024). Dataset: BOFFIN TTS: Few-shot Speaker Adaptation by Bayesian Optimization. https://doi.org/10.57702/c4k21kl5

DOI retrieved: December 3, 2024

Additional Info

Field Value
Created December 3, 2024
Last update December 3, 2024
Defined In https://doi.org/10.48550/arXiv.2002.01953
Author Henry B. Moss
More Authors
Vatsal Aggarwal
Nishant Prateek
Javier González
Roberto Barra-Chicote