-
WMT2014 German-English Translation Task
The dataset used in this paper is the WMT2014 German-English translation task, which consists of 4.51M parallel sentence pairs. -
De-En: German-English dataset
Four different language pairs have been selected for the experiments. The datasets' size varies from tens of thousands to millions of sentences to test the regularizers' ability... -
IWSLT14 German-English
Diffusion models have achieved state-of-the-art synthesis quality on both visual and audio tasks, and recent works further adapt them to textual data by diffusing on the... -
IWSLT'14 German-English Translation Dataset
The dataset contains 160K sentence pairs for German-English translation.