You're currently viewing an old version of this dataset. To see the current version, click here.

SQuAD dataset

The dataset used for training BERT consists of a concatenation of Wikipedia and BooksCorpus, specifically focused on the SQuAD task.

Data and Resources

Cite this as

Jacob Devlin, Ming-Wei Chang, Kenton Lee, Kristina Toutanova (2024). Dataset: SQuAD dataset. https://doi.org/10.57702/g3jhe3wi

DOI retrieved: November 25, 2024

Additional Info

Field Value
Created November 25, 2024
Last update November 25, 2024
Defined In https://doi.org/10.48550/arXiv.1904.00962
Author Jacob Devlin
More Authors
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
Homepage https://rajpurkar.github.io/SQuAD-explorer/