-
MaX-DeepLab: End-to-End Panoptic Segmentation with Mask Transformers
MaX-DeepLab: End-to-end panoptic segmentation with mask transformers -
FastSpeech 2: Fast and High-Quality End-to-End Text-to-Speech
FastSpeech 2 is a fast and high-quality end-to-end text-to-speech system. It uses a multi-task learning approach to learn the mapping between phonemes and waveforms.