Style Tokens

Global Style Tokens (GSTs) are a recently-proposed method to learn latent disentangled representations of high-dimensional data. GSTs can be used within Tacotron, a state-of-the-art end-to-end text-to-speech synthesis system, to uncover expressive factors of variation in speaking style.

Data and Resources

Cite this as

Daisy Stanton, Yuxuan Wang, RJ Skerry-Ryan (2024). Dataset: Style Tokens. https://doi.org/10.57702/uon2i613

DOI retrieved: December 2, 2024

Additional Info

Field Value
Created December 2, 2024
Last update December 2, 2024
Author Daisy Stanton
More Authors
Yuxuan Wang
RJ Skerry-Ryan
Homepage https://arxiv.org/abs/1803.09017