Datasets Activity Stream About Order by Relevance Name Ascending Name Descending Last Modified Go 3 datasets found Filter Results PADIC Machine translation experiments on PADIC: A parallel Arabic dialect corpus Dataset JSON Deutsches Textarchiv A diachronic corpus of German. Dataset JSON Penn Treebank The Penn Treebank dataset contains one million words of 1989 Wall Street Journal material annotated in Treebank II style, with 42k sentences of varying lengths. Dataset JSON