4 datasets found

Tags: speech processing

Filter Results
  • speechocean762

    speechocean762: An open-source non-native English speech corpus for pronunciation assessment.
  • Automatic Pronunciation Assessment

    A hierarchical context-aware modeling approach for multi-aspect and multi-granular pronunciation assessment
  • WHAM!

    The WHAM! dataset is used for testing the proposed Bayesian factorised speaker-environment adaptive training and test time adaptation approach for Conformer models.
  • LibriTTS

    A popular text-based VC approach is to use an automatic speech recognition (ASR) model to extract phonetic posteriorgram (PPG) as content representation.
You can also access this registry using the API (see API Docs).