-
Hebrew Names Transliterated in English
The dataset used for training the language identification model, containing 16,500 Hebrew names transliterated in English, 3,600 Arabic names transliterated in English, and... -
Microblog language identification
Twitter language identification dataset -
SwissText & KONVENS 2020 shared task 2
Twitter language identification dataset for Swiss German dialect