-
Hindi-English Code-Switched Sentences
The dataset used in the paper is a collection of Hindi-English code-switched sentences. -
ArzEnSEG corpus
The ArzEnSEG corpus is a morphologically annotated dataset for code-switched Egyptian Arabic-English. -
ArzEn parallel corpus
The ArzEn parallel corpus consists of speech transcriptions gathered through informal interviews with bilingual Egyptian Arabic-English speakers, as well as their English... -
SEAME corpus
SEAME corpus is a Mandarin-English code-switching speech corpus. -
ASRU 2019 Mandarin-English code-switching speech recognition challenge
The ASRU 2019 Mandarin-English code-switching speech recognition challenge dataset.