Text Classification - Groups

Text Classification Datasets

The dataset used in the paper is a collection of adversarial examples and natural examples for text classification tasks.
- Dataset
- JSON
Shakespeare dataset

Mobile crowdsensing has gained significant attention in recent years and has become a critical paradigm for emerging Internet of Things applications. The sensing devices...
- Dataset
- JSON
TFDS

Text dataset for text classification and sentiment analysis tasks.
- Dataset
- JSON
20Newsgroups dataset

The 20Newsgroups data set is a dataset of 18,846 instances of newsgroup documents.
- Dataset
- JSON
Autoencoder Trees

The MNIST handwritten digit database and the 20Newsgroups data set are used to evaluate the proposed autoencoder tree model.
- Dataset
- JSON
NYT

Text summarization aims to extract essential information from a piece of text and transform the text into a concise version.
- Dataset
- JSON
Reuters21578

The problem of similarity search is to find the most similar items in a large collection to a query item of interest. Fast similarity search is at the core of many information...
- Dataset
- JSON
AG News, SogouNews and DBpedia

The AG News, SogouNews and DBpedia datasets are used for text classification experiments.
- Dataset
- JSON
Amazon Reviews

The Amazon Reviews dataset is used to predict the usefulness of Amazon reviews using off-the-shelf argumentation mining.
- Dataset
- JSON
news20

The news20 dataset is a multiclass text classification dataset.
- Dataset
- JSON
sector

The sector dataset is a multiclass text classification dataset.
- Dataset
- JSON
rcv1

The rcv1 dataset is a multiclass text classification dataset.
- Dataset
- JSON
WebKB

The dataset used in this paper is a probabilistic logic programming dataset, which is a probabilistic version of the WebKB dataset.
- Dataset
- JSON
Reuters-8

The Reuters-8 dataset is a collection of news articles from Reuters.
- Dataset
- JSON
20Newsgrp

The 20Newsgrp dataset is a collection of news articles from 20 different newsgroups.
- Dataset
- JSON
iPosts dataset

The independently posted tweets dataset (henceforth: iPosts) that we used for contradiction detection between independently emerging claim-initiating tweets.
- Dataset
- JSON
Threads RTE dataset

The dataset on which the authors run disagreement reply detection (henceforth: Threads) was converted by us to RTE format based on the threaded conversations labeled in this...
- Dataset
- JSON
Wikipedia Neutrality Corpus

This dataset is used to test the ability of large language models to detect and correct biased Wikipedia edits according to Wikipedia's Neutral Point of View (NPOV) policy.
- Dataset
- JSON
Yelp reviews polarity dataset

Yelp reviews polarity dataset
- Dataset
- JSON
News

The News dataset consists of 5000 randomly sampled news articles from the NY Times corpus. It simulates the opinions of media consumers on news items. The units are different...
- Dataset
- JSON

182 datasets found