Text-to-Speech - Groups

FastSpeech: Fast, Robust and Controllable Text to Speech

Neural network based end-to-end text to speech (TTS) has signiﬁcantly improved the quality of synthesized speech. Prominent methods (e.g., Tacotron 2) usually ﬁrst generate...

Dataset
JSON

SNIPER Training: Single-Shot Sparse Training for Text-to-Speech

Text-to-speech (TTS) models have achieved remarkable naturalness in recent years, yet like most deep neural models, they have more parameters than necessary. Sparse TTS models...

Dataset
JSON

LibriTTS

A popular text-based VC approach is to use an automatic speech recognition (ASR) model to extract phonetic posteriorgram (PPG) as content representation.

Dataset
JSON

3 datasets found

FastSpeech: Fast, Robust and Controllable Text to Speech

SNIPER Training: Single-Shot Sparse Training for Text-to-Speech

LibriTTS