-
Movie Review (MR) and Product Review (PR) datasets
Movie Review (MR) dataset is a binary sentiment classification dataset with movie reviews from IMDB, consisting of 1000 positive and 1000 negative movie reviews. Product Review... -
CNN news articles dataset
The CNN news articles dataset is a collection of news articles crawled from the CNN website. -
SST-2 and IMDb
Stanford Sentiment Treebank Binary (SST-2) and Internet Movie Database (IMDb) datasets for sentiment classification. -
Didi Ride-Sharing Comment Dataset
The benchmark ride-sharing comment user experience data set was constructed from the real comments in the main city zone of ride-sharing orders within the time period from Mar... -
AmazonTitles-670K
The dataset used in the LightDXML paper for extreme multi-label classification. -
WikiSeeAlsoTitles-350K
The dataset used in the LightDXML paper for extreme multi-label classification. -
Wiki10-31K
The dataset used in the LightDXML paper for extreme multi-label classification. -
Clickbait Challenge 2017
The Clickbait Challenge 2017 dataset, a collection of social media posts and their corresponding article titles, used for clickbait detection. -
Fake News Challenge Stage 1 (FNC-1)
The FNC-1 dataset is a supervised classification task for stance detection, where the goal is to automatically predict the labels in a supervised classification task. -
Semeval-2016 Task 6: Detecting stance in tweets
Semeval-2016 Task 6: Detecting stance in tweets. -
Rotten Tomatoes
The Rotten Tomatoes dataset has 5331 positive and 5331 negative review sentences. -
SST2, IMDB, Rotten Tomatoes
The SST2 dataset has 6920/872/1821 example sentences in the train/dev/test sets. The task is binary classification into positive/negative sentiment. The IMDB dataset has... -
HONEST Race
The dataset used for toxicity and stereotype mitigation task, which consists of 25 thousand examples of positive and negative movie reviews. -
Sentiment Analysis Dataset
The dataset used in the paper is a collection of unstructured text data from social networks, news sites, and forums.