Wikitext-103

The dataset used in this paper is Wikitext-103, a general English language corpus containing good and featured Wikipedia articles.

Data and Resources

Cite this as

James Henderson, Fabio Fehr (2024). Dataset: Wikitext-103. https://doi.org/10.57702/b35ezmet

DOI retrieved: November 25, 2024

Additional Info

Field Value
Created November 25, 2024
Last update December 2, 2024
Defined In https://doi.org/10.48550/arXiv.2205.02517
Citation
  • https://doi.org/10.48550/arXiv.2203.06298
Author James Henderson
More Authors
Fabio Fehr
Homepage https://huggingface.co/datasets/wikitext-103