You're currently viewing an old version of this dataset. To see the current version, click here.

MS MARCO: A Human-Generated Machine Reading Comprehension Dataset

The dataset is used for training and evaluating the MS MARCO model, a question answering model.

Data and Resources

Cite this as

Payal Bajaj, Daniel Campos, Nick Craswell, Li Deng, Jianfeng Gao, Xiaodong Liu, Rangan Majumder, Bhaskar Mitra, Andrew McNamara, Mir Rosenberg, Tri Nguyen, Xia Song, Alina Stoica, Saurabh Tiwary, Tong Wang (2024). Dataset: MS MARCO: A Human-Generated Machine Reading Comprehension Dataset. https://doi.org/10.57702/lk331fw2

DOI retrieved: December 2, 2024

Additional Info

Field Value
Created December 2, 2024
Last update December 2, 2024
Defined In https://doi.org/10.1145/3626772.3657931
Citation
  • https://doi.org/10.48550/arXiv.2404.06680
Author Payal Bajaj
More Authors
Daniel Campos
Nick Craswell
Li Deng
Jianfeng Gao
Xiaodong Liu
Rangan Majumder
Bhaskar Mitra
Andrew McNamara
Mir Rosenberg
Tri Nguyen
Xia Song
Alina Stoica
Saurabh Tiwary
Tong Wang
Homepage https://arxiv.org/abs/1705.07830