-
Character Transformations for Non-Autoregressive GEC Tagging
Propose character-based method to generate target transformation instructions for GEC tagging models, as an alternative to autoregressive models. Compare character... -
Adult and German
The dataset used in the paper is a classification dataset, specifically Adult and German. -
Grascco - the first publicly shared, multiply-alienated German clinical text ...
Annotated corpus of textual explanations for clinical decision support -
A Database of German Emotional Speech
EMO-DB is a speech emotion database containing 10 actors speaking 10 sentences in German with archetypical emotions. -
PADT, EWT, GSD, HDT, and SynTagRus
PADT, EWT, GSD, HDT, and SynTagRus are UD treebanks. -
WMT 2014 English-German task
The dataset used for the Second Workshop on Neural Machine Translation and Generation -
A New Aligned Simple German Corpus
A new sentence-aligned monolingual corpus for Simple German – German. It contains multiple document-aligned sources which we have aligned using automatic sentence-alignment... -
IWSLT 2014
The IWSLT 2014 German-to-English dataset is a machine translation dataset, containing 153K sentence pairs.