Dataset - LDM

BBC-M5

The dataset used in the paper is a collection of documents for document semantics comparison and clustering tasks.
- Dataset
- JSON
Reuters-M7

The dataset used in the paper is a collection of documents for document semantics comparison and clustering tasks.
- Dataset
- JSON
20News-C10

The dataset used in the paper is a collection of documents for document semantics comparison and clustering tasks.
- Dataset
- JSON
20News-M5

The dataset used in the paper is a collection of documents for document semantics comparison and clustering tasks.
- Dataset
- JSON
STS2017

The dataset used in the paper is a collection of documents for document semantics comparison and clustering tasks.
- Dataset
- JSON
Li30

The dataset used in the paper is a collection of documents for document semantics comparison and clustering tasks.
- Dataset
- JSON
Lee60

The dataset used in the paper is a collection of documents for document semantics comparison and clustering tasks.
- Dataset
- JSON
SICK

SICK is a dataset for recognizing textual entailment (RTE), containing 4.5K/0.5K/5.0K train/dev/test examples. Each example consists of a hypothesis and a premise, and the goal...
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

8 datasets found