You're currently viewing an old version of this dataset. To see the current version, click here.

WMT14 Dataset

The dataset consists of translations produced by a state-of-the-art neural machine translation (NMT) Transformer model. It follows the WMT14 data setup, optimized on the test set of WMT13.

Data and Resources

This dataset has no data

Cite this as

Ondřej Bojar, Christian Federmann, Mark Fishel, Yvette Graham, Barry Haddow, Matthias Huck, Philipp Koehn, Christof Monz (2024). Dataset: WMT14 Dataset. https://doi.org/10.57702/ccf3vvu2

Private DOI This DOI is not yet resolvable.
It is available for use in manuscripts, and will be published when the Dataset is made public.

Additional Info

Field Value
Created November 25, 2024
Last update November 25, 2024
Defined In https://doi.org/10.48550/arXiv.1903.12017
Author Ondřej Bojar
More Authors
Christian Federmann
Mark Fishel
Yvette Graham
Barry Haddow
Matthias Huck
Philipp Koehn
Christof Monz
Homepage https://www.statmt.org/wmt14/translation-task.html