Measuring Massive Multitask Language Understanding

The dataset used in this paper is a multiple choice question set that allows for the evaluation of large language models.

Data and Resources

Cite this as

Reid McIlroy-Young, Katrina Brown, Conlan Olson, Linjun Zhang, Cynthia Dwork (2024). Dataset: Measuring Massive Multitask Language Understanding. https://doi.org/10.57702/qxktk0p2

DOI retrieved: December 17, 2024

Additional Info

Field Value
Created December 17, 2024
Last update December 17, 2024
Defined In https://doi.org/10.48550/arXiv.2305.14325
Citation
  • https://doi.org/10.48550/arXiv.2406.06581
Author Reid McIlroy-Young
More Authors
Katrina Brown
Conlan Olson
Linjun Zhang
Cynthia Dwork
Homepage https://arxiv.org/abs/2009.14552