You're currently viewing an old version of this dataset. To see the current version, click here.

MGSM

The MGSM dataset is a multilingual math reasoning dataset containing around 7,500 training samples and 1,319 testing samples.

Data and Resources

This dataset has no data

Cite this as

Rohan Anil, Andrew M. Dai, Orhan Firat, Melvin Johnson, Dmitry Lepikhin, Alexandre Passos, Siamak Shakeri, Emanuel Taropa (2025). Dataset: MGSM. https://doi.org/10.57702/f17ipaia

Private DOI This DOI is not yet resolvable.
It is available for use in manuscripts, and will be published when the Dataset is made public.

Additional Info

Field Value
Created January 3, 2025
Last update January 3, 2025
Defined In https://doi.org/10.48550/arXiv.2406.02301
Author Rohan Anil
More Authors
Andrew M. Dai
Orhan Firat
Melvin Johnson
Dmitry Lepikhin
Alexandre Passos
Siamak Shakeri
Emanuel Taropa