-
Reuters-21578
Text classification problem has long been an interesting research field, the aim of text classification is to develop algorithm to find the categories of given documents. -
Amazon Review
The Amazon Review dataset is a widely used benchmark dataset for cross-domain sentiment analysis. -
Text Classification based on Multiple Block Convolutional Highways
Text classification based on Multiple Block Convolutional Highways -
Yelp Dataset Challenge
The Yelp dataset challenge contains reviews and images of restaurants, with the goal of recommending images for each review. -
Amazon@Beauty and Amazon@Books datasets
The Amazon@Beauty dataset is a collection of product reviews from Amazon.com, and the Amazon@Books dataset is a collection of product reviews from Amazon.com. -
OpenWebText Corpus
A dataset for language modeling, where the goal is to predict the next word in a sequence given the previous words. -
The pushshift reddit dataset
The pushshift reddit dataset -
Conditional Generative Matching Model for Multi-lingual Reply Suggestion
A Conditional Generative Matching Model for Multi-lingual Reply Suggestion -
IMDB dataset
The IMDB dataset is a polarity dataset for sentiment analysis or text classification, it contains 50000 sentences and their binary class labels, being either "Positive" or... -
Disin dataset
The Disin dataset is a fake news dataset on Kaggle, including 12,600 fake news articles and 12,600 truthful news articles. -
COVID-19 Research Articles Classification
The dataset used for text classification to support Epistemonikos' effort to filter and categorize research articles related to COVID-19. -
AGNews Dataset
The AGNews dataset is a collection of news articles, where each article is labeled with a topic (e.g. politics, sports, etc.). -
Amazon
The dataset used in the paper is a series of datasets introduced in [46], comprising large corpora of product reviews crawled from Amazon.com. Top-level product categories on...