DGT corpus

The dataset is a parallel corpus of aligned sentences across nine languages (36 language pairs) from the DGT corpus, used for language comparison experiments.

BibTex: