3 datasets found

Tags: Mathematical Reasoning

Filter Results
  • Qiyas Benchmark

    The Qiyas benchmark is a standardized General Aptitude Test (GAT) used for university admissions in Saudi Arabia, ensuring its quality and relevance to real-world assessment. It...
  • GSM8K

    Mathematical reasoning tasks involve mapping a question into a series of equations, which are then solved to obtain the final answer.
  • MathQA

    MathQA is an English mathematical problems dataset at GRE level. The original MathQA dataset is annotated in a different way from Math23k with many pre-defined operations.
You can also access this registry using the API (see API Docs).