WikiText-103 and Enwik8 datasets

WikiText-103 and Enwik8 datasets are used for language modeling tasks

Data and Resources

Cite this as

Md Shamim Hussain, Mohammed J. Zaki, Dharmashankar Subramanian (2024). Dataset: WikiText-103 and Enwik8 datasets. https://doi.org/10.57702/m450bv0r

DOI retrieved: December 17, 2024

Additional Info

Field Value
Created December 17, 2024
Last update December 17, 2024
Defined In https://doi.org/10.1145/3580305.3599520
Author Md Shamim Hussain
More Authors
Mohammed J. Zaki
Dharmashankar Subramanian
Homepage https://huggingface.co/datasets/wikitext