BooksCorpus

The BooksCorpus dataset consists of 11,038 books and has been used for text-only training.

Data and Resources

Cite this as

Zhu et al. (2024). Dataset: BooksCorpus. https://doi.org/10.57702/1bm89z4v

DOI retrieved: November 25, 2024

Additional Info

Field Value
Created November 25, 2024
Last update November 25, 2024
Defined In https://doi.org/10.48550/arXiv.1908.08530
Author Zhu et al.