StrategyQA

The StrategyQA dataset is used to evaluate the ability of LLMs in generating accurate answers to multi-step reasoning questions.

Data and Resources

Cite this as

Mor Geva, Daniel Khashabi, Elad Segal, Tushar Khot, Dan Roth, Jonathan Berant (2025). Dataset: StrategyQA. https://doi.org/10.57702/nkmvf3lj

DOI retrieved: January 2, 2025

Additional Info

Field Value
Created January 2, 2025
Last update January 2, 2025
Defined In https://doi.org/10.48550/arXiv.2302.12246
Citation
  • https://doi.org/10.48550/arXiv.2406.13929
  • https://doi.org/10.48550/arXiv.2212.09656
  • https://doi.org/10.48550/arXiv.2403.01390
  • https://doi.org/10.48550/arXiv.2402.10612
  • https://doi.org/10.48550/arXiv.2402.18678
  • https://doi.org/10.48550/arXiv.2309.13075
Author Mor Geva
More Authors
Daniel Khashabi
Elad Segal
Tushar Khot
Dan Roth
Jonathan Berant
Homepage https://huggingface.co/datasets/metaeval/strategy-qa