Dataset - LDM

Wikitext-103 and LAMBADA datasets

The dataset used in the paper is not explicitly mentioned, but it is mentioned that the authors trained a GPT2 transformer language model on the Wikitext-103 and LAMBADA datasets.
- Dataset
- JSON
WikiText-103 dataset

The dataset used in this paper is the WikiText-103 dataset, which contains a large corpus of text.
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

2 datasets found