2 datasets found

Tags: Multilingual

Filter Results
  • OSCAR corpus

    The dataset used in this study is the OSCAR corpus, which is a multilingual corpus that is obtained by filtering of the Common Crawl corpus.
  • Parallel Meaning Bank

    A semantically annotated parallel corpus for English, German, Italian, and Dutch where sentences are aligned with scoped meaning representations in order to capture the...