-
Wikitext-103 and MusDB datasets
The dataset used in the paper is not explicitly mentioned, but it is mentioned that the authors trained a 16 layers transformer (Vaswani et al., 2017) based language model on... -
ANALYSING DISCRETE SELF SUPERVISED SPEECH REPRESENTATION FOR SPOKEN LANGUAGE ...
This work profoundly analyzes discrete self-supervised speech representations (units) through the eyes of Generative Spoken Language Modeling (GSLM). -
speechocean762
speechocean762: An open-source non-native English speech corpus for pronunciation assessment. -
Automatic Pronunciation Assessment
A hierarchical context-aware modeling approach for multi-aspect and multi-granular pronunciation assessment