-
BookCorpus
The dataset used in this paper for unsupervised sentence representation learning, consisting of paragraphs from unlabeled text. -
WikiText-103 dataset
The dataset used in this paper is the WikiText-103 dataset, which contains a large corpus of text. -
Training Transformers to Perform Tasks
A dataset for training transformers to perform tasks such as language translation and text generation.