Dataset - LDM

Constrained Neural Language Generation

The dataset used in this paper for constrained neural language generation.
- Dataset
- JSON
Chinese Corpus

The dataset is used to analyze corpora in a completely language independent and unsupervised way without any prior linguistic knowledge.
- Dataset
- JSON
Accountant Corpus

The dataset is used to analyze corpora in a completely language independent and unsupervised way without any prior linguistic knowledge.
- Dataset
- JSON
Medline Corpus

The dataset is used to analyze corpora in a completely language independent and unsupervised way without any prior linguistic knowledge.
- Dataset
- JSON
Wittgenstein Corpus

The dataset is used to analyze corpora in a completely language independent and unsupervised way without any prior linguistic knowledge.
- Dataset
- JSON
EU-Parliament Corpus

The dataset is used to analyze corpora in a completely language independent and unsupervised way without any prior linguistic knowledge.
- Dataset
- JSON
Long-Short Transformer

The Long-Short Transformer dataset is a dataset for language and vision.
- Dataset
- JSON
Wikipedia Corpus

The dataset used in the paper is a subset of the Wikipedia corpus, consisting of 7500 English Wikipedia articles belonging to one of the following categories: People, Cities,...
- Dataset
- JSON
Bengali Hate Speech Dataset

The Bengali Hate Speech Dataset is a large-scale dataset for hate speech detection in the Bengali language. It contains 8,087 labelled examples, categorized into political,...
- Dataset
- JSON
MSR-VTT

The dataset used in the paper is MSR-VTT, a large video description dataset for bridging video and language. The dataset contains 10k video clips with length varying from 10 to...
- Dataset
- JSON
Cityscapes

The Cityscapes dataset is a large and famous city street scene semantic segmentation dataset. 19 classes of which 30 classes of this dataset are considered for training and...
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

11 datasets found