8 datasets found

Tags: document similarity

Filter Results
  • BBC-M5

    The dataset used in the paper is a collection of documents for document semantics comparison and clustering tasks.
  • Reuters-M7

    The dataset used in the paper is a collection of documents for document semantics comparison and clustering tasks.
  • 20News-C10

    The dataset used in the paper is a collection of documents for document semantics comparison and clustering tasks.
  • 20News-M5

    The dataset used in the paper is a collection of documents for document semantics comparison and clustering tasks.
  • STS2017

    The dataset used in the paper is a collection of documents for document semantics comparison and clustering tasks.
  • Li30

    The dataset used in the paper is a collection of documents for document semantics comparison and clustering tasks.
  • Lee60

    The dataset used in the paper is a collection of documents for document semantics comparison and clustering tasks.
  • SICK

    SICK is a dataset for recognizing textual entailment (RTE), containing 4.5K/0.5K/5.0K train/dev/test examples. Each example consists of a hypothesis and a premise, and the goal...
You can also access this registry using the API (see API Docs).