You're currently viewing an old version of this dataset. To see the current version, click here.

StrategyQA

The StrategyQA dataset is used to evaluate the ability of LLMs in generating accurate answers to multi-step reasoning questions.

Data and Resources

This dataset has no data

Mor Geva, Daniel Khashabi, Elad Segal, Tushar Khot, Dan Roth, Jonathan Berant (2025). Dataset: StrategyQA. https://doi.org/10.57702/nkmvf3lj

Private DOI This DOI is not yet resolvable.
It is available for use in manuscripts, and will be published when the Dataset is made public.

Field	Value
Created	January 2, 2025
Last update	January 2, 2025
Defined In	https://doi.org/10.48550/arXiv.2302.12246
Citation	https://doi.org/10.48550/arXiv.2406.13929 https://doi.org/10.48550/arXiv.2212.09656 https://doi.org/10.48550/arXiv.2403.01390 https://doi.org/10.48550/arXiv.2402.10612 https://doi.org/10.48550/arXiv.2402.18678 https://doi.org/10.48550/arXiv.2309.13075
Author	Mor Geva
More Authors	Daniel Khashabi Elad Segal Tushar Khot Dan Roth Jonathan Berant
Homepage	https://huggingface.co/datasets/metaeval/strategy-qa