Datasets Activity Stream About Order by Relevance Name Ascending Name Descending Last Modified Go 1 dataset found Groups: Materials Science Formats: JSON Filter Results S2ORC A collection of 81.1 million scholarly publications in English from various academic fields, used to pre-train a language model. Dataset JSON