Datasets Activity Stream About Order by Relevance Name Ascending Name Descending Last Modified Go 2 datasets found Groups: Machine Translation Filter Results Vakyansh The dataset is used for training and testing the proposed punctuation restoration and inverse text normalization models. Dataset JSON Penn Treebank The Penn Treebank dataset contains one million words of 1989 Wall Street Journal material annotated in Treebank II style, with 42k sentences of varying lengths. Dataset JSON