-
Semantic Scholar Open Research Corpus
The Semantic Scholar Open Research Corpus contains meta-data of 46,947,044 published research papers in Computer Science, Neuroscience, and Bio-medicine from 1936 to 2019. -
CitHepPh Dataset
The CitHepPh dataset is a citation graph of 34,546 papers with 421,578 edges during the period from January 1993 to April 2002. -
PapersWithCode dataset
The dataset used in the paper is the PapersWithCode dataset, which contains papers and code from various machine learning conferences. -
Elsevier OA CC-BY corpus
The Elsevier OA CC-BY corpus dataset consists of 40,000 open-access articles from across Elsevier's journals, representing a diverse research discipline.