You're currently viewing an old version of this dataset. To see the current version, click here.

WMT14 Dataset

The dataset consists of translations produced by a state-of-the-art neural machine translation (NMT) Transformer model. It follows the WMT14 data setup, optimized on the test set of WMT13.

Data and Resources

Cite this as

Ondřej Bojar, Christian Federmann, Mark Fishel, Yvette Graham, Barry Haddow, Matthias Huck, Philipp Koehn, Christof Monz (2024). Dataset: WMT14 Dataset. https://doi.org/10.57702/ccf3vvu2

DOI retrieved: November 25, 2024

Additional Info

Field Value
Created November 25, 2024
Last update November 25, 2024
Defined In https://doi.org/10.48550/arXiv.1903.12017
Author Ondřej Bojar
More Authors
Christian Federmann
Mark Fishel
Yvette Graham
Barry Haddow
Matthias Huck
Philipp Koehn
Christof Monz
Homepage https://www.statmt.org/wmt14/translation-task.html