-
Decimal Addition Dataset
The dataset used in this paper is a collection of decimal addition tasks, where the input lengths range from 1 to 40 digits. The dataset is used to evaluate the ability of... -
Bounded Idempotent Pocrims
The dataset used in the paper is a collection of bounded idempotent pocrims, hoops, and coops, which are used to model the semantics of continuous logic. -
Bounded Involutive Pocrims
The dataset used in the paper is a collection of bounded involutive pocrims, hoops, and coops, which are used to model the semantics of continuous logic. -
Continuous Logic
The dataset used in the paper is a collection of pocrims, hoops, and coops, which are used to model the semantics of continuous logic. -
Machine Number Sense
A comprehensive indicator of mathematical thinking and intelligence, the number sense bridges the induction of symbolic concepts and the competence of problem-solving. -
arXMLiv 2018
The arXMLiv 2018 dataset is an HTML collection of the arXiv.org preprint archive, used as a training corpus for word embedding techniques. -
Dataset for Implicit Automated Assessment of Mathematical Short Answer Items
The dataset used for this task was derived as a supplementary dataset provided in connection with a national assessment program. It was administered alongside a short answer... -
Sacrobosco Tables
The Sacrobosco Tables dataset contains numerical tables from the Sacrobosco Collection, a corpus of 359 early modern printed editions of textbooks on astronomy used at European... -
Dolanský and Dolanský (1952) Logarithmic Values
Dolanský and Dolanský (1952) dataset contains data on logarithmic values for calculating binary or base two logarithms. -
Automated discovery of mathematical definitions in text
Automated discovery of mathematical definitions in text. -
Assistments
Assistments is an electronic tutor that teaches and evaluates students in grade-school math. -
Proof-Pile-2
The dataset used for continual pre-training of large language models, with a focus on balancing the text distribution and mitigating overfitting. -
DeepMind Mathematics Dataset
The DeepMind Mathematics Dataset consists of synthetically generated math problems. They cover a range of problem types including: Numbers, comparison, measurement, arithmetic,... -
HOL Light and Flyspeck corpora
The dataset consists of the core HOL Light corpus and the Flyspeck corpus, with millions of nodes representing atomic inferences. -
COVID-19 dataset
The dataset used in the paper is COVID-19 case data, state restriction policy, population and density, population with higher risk, age structure data, race structure data, and...