-
Squared Curvature Regularization
The dataset used in the paper for testing the proposed squared curvature regularization approach. -
Penn Treebank Character
The Penn Treebank Character dataset is a character-level language modeling dataset. -
Improved Language Modeling by Decoding the Past
Highly regularized LSTMs achieve impressive results on several benchmark datasets in language modeling. We propose a new regularization method based on decoding the last token... -
Reg-mixup: Mixup as a regularizer can surprisingly improve accuracy and out d...
Mixup as a regularizer can surprisingly improve accuracy and out distribution robustness. -
Penn Treebank
The Penn Treebank dataset contains one million words of 1989 Wall Street Journal material annotated in Treebank II style, with 42k sentences of varying lengths.