-
Authority and Alignment in Wikipedia Discussions (AAWD)
A newly created corpus of Wikipedia Talk pages for dispute detection -
Classification of Research Citations (CRC)
A dataset of 150 research papers from the domain of computer science, manually annotated and class labelled for sentiment analysis. -
Stream TwitterSentiment
Stream TwitterSentiment is a dataset of tweets, focusing on sentiment analysis, and is used to test the performance of active stream learning algorithms for polarity learning. -
Hatespeech
The Hatespeech dataset is a collection of tweets containing lexicons used in hate speech. -
Targeted Sentiment Analysis
Targeted sentiment analysis for Norwegian text -
Sentence-BERT
Sentence-BERT: Sentence embeddings using Siamese BERT-networks -
Amharic Sentiment Analysis
Exploring Amharic sentiment analysis from social media texts: Building annotation tools and classification models -
NaijaSenti
NaijaSenti: A Nigerian Twitter sentiment corpus for multilingual sentiment analysis -
AfriSenti-SemEval-2023 Task 12
AfriSenti-SemEval-2023 Task 12: Multilingual fine-tuning for sentiment classification in low-resource languages -
IMDB and Yelp datasets
IMDB and Yelp are datasets used for sentiment analysis and author identification. -
Entity-Specific Sentiment Classification of Yahoo News Comments
The dataset is used for entity-specific sentiment classification of Yahoo News comments. -
Sentiment-Driven Stochastic Volatility Model
The dataset contains high-frequency news sentiment and volatility of the S&P 500. -
Tweet Sentiment Extraction
The Tweet Sentiment Extraction dataset contains positive, negative, and neutral tweets with human-annotated rationales. -
Movie Reviews
The Movie Reviews dataset contains positive and negative movie reviews with rationales annotated by humans to support classification. -
Yelp Sentiment Dataset
The Yelp sentiment dataset contains labeled sentiment annotations for Yelp reviews. -
IMDb Reviews
The dataset consists of 25000 reviews from IMDb.