24 datasets found

Groups: Mathematics

Filter Results
  • Decimal Addition Dataset

    The dataset used in this paper is a collection of decimal addition tasks, where the input lengths range from 1 to 40 digits. The dataset is used to evaluate the ability of...
  • Bounded Idempotent Pocrims

    The dataset used in the paper is a collection of bounded idempotent pocrims, hoops, and coops, which are used to model the semantics of continuous logic.
  • Bounded Involutive Pocrims

    The dataset used in the paper is a collection of bounded involutive pocrims, hoops, and coops, which are used to model the semantics of continuous logic.
  • Continuous Logic

    The dataset used in the paper is a collection of pocrims, hoops, and coops, which are used to model the semantics of continuous logic.
  • Machine Number Sense

    A comprehensive indicator of mathematical thinking and intelligence, the number sense bridges the induction of symbolic concepts and the competence of problem-solving.
  • MathMLBen

    The MathMLBen dataset is used to evaluate the performance of formula embedding techniques for mathematical information retrieval.
  • arXMLiv 2018

    The arXMLiv 2018 dataset is an HTML collection of the arXiv.org preprint archive, used as a training corpus for word embedding techniques.
  • Dataset for Implicit Automated Assessment of Mathematical Short Answer Items

    The dataset used for this task was derived as a supplementary dataset provided in connection with a national assessment program. It was administered alongside a short answer...
  • Math

    The dataset used in the paper is a set of educational mathematics problems that reason about prime numbers, square numbers, and triangle numbers.
  • Sacrobosco Tables

    The Sacrobosco Tables dataset contains numerical tables from the Sacrobosco Collection, a corpus of 359 early modern printed editions of textbooks on astronomy used at European...
  • Dolanský and Dolanský (1952) Logarithmic Values

    Dolanský and Dolanský (1952) dataset contains data on logarithmic values for calculating binary or base two logarithms.
  • Automated discovery of mathematical definitions in text

    Automated discovery of mathematical definitions in text.
  • MuLiMa

    MuLiMa is a multilingual dictionary of mathematics curated by mathematicians.
  • MathGloss

    MathGloss is a project to create a knowledge graph (KG) for undergraduate mathematics from text, automatically, using modern natural language processing (NLP) tools and...
  • Assistments

    Assistments is an electronic tutor that teaches and evaluates students in grade-school math.
  • Proof-Pile-2

    The dataset used for continual pre-training of large language models, with a focus on balancing the text distribution and mitigating overfitting.
  • DeepMind Mathematics Dataset

    The DeepMind Mathematics Dataset consists of synthetically generated math problems. They cover a range of problem types including: Numbers, comparison, measurement, arithmetic,...
  • HOL Light and Flyspeck corpora

    The dataset consists of the core HOL Light corpus and the Flyspeck corpus, with millions of nodes representing atomic inferences.
  • COVID-19 dataset

    The dataset used in the paper is COVID-19 case data, state restriction policy, population and density, population with higher risk, age structure data, race structure data, and...
  • Math23k

    Math23k is the most commonly used Chinese dataset in MWP solving. It contains 23,162 problems with 21,162 training problems, 1,000 validation problems and 1,000 testing problems.
You can also access this registry using the API (see API Docs).