StrategyQA

You're currently viewing an old version of this dataset. To see the current version, click here.

The StrategyQA dataset is used to evaluate the ability of LLMs in generating accurate answers to multi-step reasoning questions.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Mor Geva, Daniel Khashabi, Elad Segal, Tushar Khot, Dan Roth, Jonathan Berant (2025). Dataset: StrategyQA. https://doi.org/10.57702/nkmvf3lj

DOI retrieved: January 2, 2025

Field	Value
Created	January 2, 2025
Last update	January 2, 2025
Defined In	https://doi.org/10.48550/arXiv.2302.12246
Citation	https://doi.org/10.48550/arXiv.2406.13929 https://doi.org/10.48550/arXiv.2212.09656 https://doi.org/10.48550/arXiv.2403.01390 https://doi.org/10.48550/arXiv.2402.10612 https://doi.org/10.48550/arXiv.2402.18678 https://doi.org/10.48550/arXiv.2309.13075
Author	Mor Geva
More Authors	Daniel Khashabi Elad Segal Tushar Khot Dan Roth Jonathan Berant
Homepage	https://huggingface.co/datasets/metaeval/strategy-qa