Open Research - Groups - LDM

S2ORC

A collection of 81.1 million scholarly publications in English from various academic fields, used to pre-train a language model.
- Dataset
- JSON

Before browse our site, please accept our cookies policy