Language learning - Groups

Lang-84

The dataset used in this paper is a collection of parallel sentence pairs from 96 different native languages, with at least 10,000 sentence pairs per language.
- Dataset
- JSON
Cambridge First Certiﬁcate in English (FCE) dataset

The Cambridge First Certiﬁcate in English (FCE) dataset is used as the source of ESL data. The corpus is a subset of the Cambridge Learner Corpus (CLC) and contains English...
- Dataset
- JSON

2 datasets found