You're currently viewing an old version of this dataset. To see the current version, click here.

WebText

The dataset used in this paper is the WebText dataset, which is a widely used dataset for natural language processing tasks.

Data and Resources

Cite this as

Wenhong Zhu, Hongkun Hao, Rui Wang (2024). Dataset: WebText. https://doi.org/10.57702/w4ewuze3

DOI retrieved: December 17, 2024

Additional Info

Field Value
Created December 17, 2024
Last update December 17, 2024
Defined In https://doi.org/10.48550/arXiv.2310.14971
Author Wenhong Zhu
More Authors
Hongkun Hao
Rui Wang
Homepage https://huggingface.co/datasets/webtext