11 datasets found

Tags: Language

Filter Results
  • Constrained Neural Language Generation

    The dataset used in this paper for constrained neural language generation.
  • Chinese Corpus

    The dataset is used to analyze corpora in a completely language independent and unsupervised way without any prior linguistic knowledge.
  • Accountant Corpus

    The dataset is used to analyze corpora in a completely language independent and unsupervised way without any prior linguistic knowledge.
  • Medline Corpus

    The dataset is used to analyze corpora in a completely language independent and unsupervised way without any prior linguistic knowledge.
  • Wittgenstein Corpus

    The dataset is used to analyze corpora in a completely language independent and unsupervised way without any prior linguistic knowledge.
  • EU-Parliament Corpus

    The dataset is used to analyze corpora in a completely language independent and unsupervised way without any prior linguistic knowledge.
  • Long-Short Transformer

    The Long-Short Transformer dataset is a dataset for language and vision.
  • Wikipedia Corpus

    The dataset used in the paper is a subset of the Wikipedia corpus, consisting of 7500 English Wikipedia articles belonging to one of the following categories: People, Cities,...
  • Bengali Hate Speech Dataset

    The Bengali Hate Speech Dataset is a large-scale dataset for hate speech detection in the Bengali language. It contains 8,087 labelled examples, categorized into political,...
  • MSR-VTT

    The dataset used in the paper is MSR-VTT, a large video description dataset for bridging video and language. The dataset contains 10k video clips with length varying from 10 to...
  • Cityscapes

    The Cityscapes dataset is a large and famous city street scene semantic segmentation dataset. 19 classes of which 30 classes of this dataset are considered for training and...
You can also access this registry using the API (see API Docs).