-
Streaming end-to-end bilingual ASR systems with joint language identiļ¬cation
Multilingual ASR technology simplifies model training and deployment, but its accuracy is known to depend on the availability of language information at runtime. -
Hebrew Names Transliterated in English
The dataset used for training the language identification model, containing 16,500 Hebrew names transliterated in English, 3,600 Arabic names transliterated in English, and... -
Microblog language identification
Twitter language identification dataset -
SwissText & KONVENS 2020 shared task 2
Twitter language identification dataset for Swiss German dialect