-
Decimal Addition Dataset
The dataset used in this paper is a collection of decimal addition tasks, where the input lengths range from 1 to 40 digits. The dataset is used to evaluate the ability of... -
Automated discovery of mathematical definitions in text
Automated discovery of mathematical definitions in text. -
Proof-Pile-2
The dataset used for continual pre-training of large language models, with a focus on balancing the text distribution and mitigating overfitting.