DGT corpus

The dataset is a parallel corpus of aligned sentences across nine languages (36 language pairs) from the DGT corpus, used for language comparison experiments.

Data and Resources

Cite this as

Blaˇz ˇSkrlj, Senja Pollak (2024). Dataset: DGT corpus. https://doi.org/10.57702/gbwssekk

DOI retrieved: December 3, 2024

Additional Info

Field Value
Created December 3, 2024
Last update December 3, 2024
Defined In https://doi.org/10.1007/978-3-030-31372-2_10
Author Blaˇz ˇSkrlj
More Authors
Senja Pollak
Homepage https://opus.jrc.ec.europa.eu/en/previous-projects/opus-corpus.html