1 dataset found

Formats: JSON Tags: French biomedical data

Filter Results
  • OSCAR

    The OSCAR corpus is a multilingual web corpus that is used for pre-training large generative language models. It is a document-oriented corpus that is comparable in size and...
You can also access this registry using the API (see API Docs).