Code-switching - Groups

Hindi-English Code-Switched Sentences

The dataset used in the paper is a collection of Hindi-English code-switched sentences.
- Dataset
- JSON
ArzEnSEG corpus

The ArzEnSEG corpus is a morphologically annotated dataset for code-switched Egyptian Arabic-English.
- Dataset
- JSON
ArzEn parallel corpus

The ArzEn parallel corpus consists of speech transcriptions gathered through informal interviews with bilingual Egyptian Arabic-English speakers, as well as their English...
- Dataset
- JSON
SEAME corpus

SEAME corpus is a Mandarin-English code-switching speech corpus.
- Dataset
- JSON
ASRU 2019 Mandarin-English code-switching speech recognition challenge

The ASRU 2019 Mandarin-English code-switching speech recognition challenge dataset.
- Dataset
- JSON

5 datasets found