You're currently viewing an old version of this dataset. To see the current version, click here.

arXiv dataset

The dataset used in this paper is a collection of arXiv papers in English, filtered to include only those written in English, with LATEX source available, compilable on a modern LATEX distribution, and containing at least a theorem or a proof environment.

Data and Resources

Cite this as

Shrey Mishra, Antoine Gauquier, Pierre Senellart (2024). Dataset: arXiv dataset. https://doi.org/10.57702/ep6960zn

DOI retrieved: December 3, 2024

Additional Info

Field Value
Created December 3, 2024
Last update December 3, 2024
Author Shrey Mishra
More Authors
Antoine Gauquier
Pierre Senellart
Homepage https://github.com/mv96/mm_extraction