1 dataset found

Formats: JSON Tags: scientific text

Filter Results
  • S2ORC

    A collection of 81.1 million scholarly publications in English from various academic fields, used to pre-train a language model.
You can also access this registry using the API (see API Docs).