182 datasets found

Filter Results
  • Text Classification Datasets

    The dataset used in the paper is a collection of adversarial examples and natural examples for text classification tasks.
  • Shakespeare dataset

    Mobile crowdsensing has gained significant attention in recent years and has become a critical paradigm for emerging Internet of Things applications. The sensing devices...
  • TFDS

    Text dataset for text classification and sentiment analysis tasks.
  • 20Newsgroups dataset

    The 20Newsgroups data set is a dataset of 18,846 instances of newsgroup documents.
  • Autoencoder Trees

    The MNIST handwritten digit database and the 20Newsgroups data set are used to evaluate the proposed autoencoder tree model.
  • NYT

    Text summarization aims to extract essential information from a piece of text and transform the text into a concise version.
  • Reuters21578

    The problem of similarity search is to find the most similar items in a large collection to a query item of interest. Fast similarity search is at the core of many information...
  • AG News, SogouNews and DBpedia

    The AG News, SogouNews and DBpedia datasets are used for text classification experiments.
  • Amazon Reviews

    The Amazon Reviews dataset is used to predict the usefulness of Amazon reviews using off-the-shelf argumentation mining.
  • news20

    The news20 dataset is a multiclass text classification dataset.
  • sector

    The sector dataset is a multiclass text classification dataset.
  • rcv1

    The rcv1 dataset is a multiclass text classification dataset.
  • WebKB

    The dataset used in this paper is a probabilistic logic programming dataset, which is a probabilistic version of the WebKB dataset.
  • Reuters-8

    The Reuters-8 dataset is a collection of news articles from Reuters.
  • 20Newsgrp

    The 20Newsgrp dataset is a collection of news articles from 20 different newsgroups.
  • iPosts dataset

    The independently posted tweets dataset (henceforth: iPosts) that we used for contradiction detection between independently emerging claim-initiating tweets.
  • Threads RTE dataset

    The dataset on which the authors run disagreement reply detection (henceforth: Threads) was converted by us to RTE format based on the threaded conversations labeled in this...
  • Wikipedia Neutrality Corpus

    This dataset is used to test the ability of large language models to detect and correct biased Wikipedia edits according to Wikipedia's Neutral Point of View (NPOV) policy.
  • Yelp reviews polarity dataset

    Yelp reviews polarity dataset
  • News

    The News dataset consists of 5000 randomly sampled news articles from the NY Times corpus. It simulates the opinions of media consumers on news items. The units are different...