BOFFIN TTS: Few-shot Speaker Adaptation by Bayesian Optimization

doi:doi:10.57702/c4k21kl5

BOFFIN TTS: Few-shot Speaker Adaptation by Bayesian Optimization

BOFFIN TTS is a novel approach for few-shot speaker adaptation. The task is to fine-tune a pre-trained TTS model to mimic a new speaker using a small corpus of target utterances.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Cite this as

Henry B. Moss, Vatsal Aggarwal, Nishant Prateek, Javier González, Roberto Barra-Chicote (2024). Dataset: BOFFIN TTS: Few-shot Speaker Adaptation by Bayesian Optimization. https://doi.org/10.57702/c4k21kl5

DOI retrieved: December 3, 2024

Additional Info

Field	Value
Created	December 3, 2024
Last update	December 3, 2024
Defined In	https://doi.org/10.48550/arXiv.2002.01953
Author	Henry B. Moss
More Authors	Vatsal Aggarwal Nishant Prateek Javier González Roberto Barra-Chicote