-
Penn Tree Bank
The Penn Tree Bank dataset is a corpus split into a training, validation and testing set of 929k words, a validation set of 73k words, and a test set of 82k words. The... -
RP dataset
The RP dataset, derived from the RELPRON dataset, consists of 105 noun phrases containing relative clauses. -
MC (Meaning Classification) dataset
The MC (Meaning Classification) dataset is a specially crafted dataset used for a classification task. -
Multi-Scale Feature Fusion Quantum Depthwise Convolutional Neural Networks fo...
Text classification is an important and widely studied task in natural language processing (NLP), with extensive applications such as sentiment analysis, topic classification,... -
AG's News Corpus
AG's News Corpus -
BanFakeNews
The BanFakeNews dataset is a publicly available annotated dataset consisting of approximately 50K Bangla news articles, with 97.4% of the majority class and 2.6% of the minority... -
IMDB Document
The dataset used in the paper is a collection of text sequences for text classification tasks. -
Yelp 2014 Document
The dataset used in the paper is a collection of text sequences for text classification tasks. -
Yelp 2013 Document
The dataset used in the paper is a collection of text sequences for text classification tasks. -
Yelp Review Dataset
The Yelp review dataset contains hotel and restaurant reviews filtered (spam) and recommended (legitimate) by Yelp. -
20NG Dataset
The 20NG dataset is a text classification dataset containing 20 categories. -
Ohsumed Dataset
The Ohsumed dataset is a text classification dataset containing 3,357 documents. -
Reuters Dataset
The Reuters dataset is a text classification dataset containing 21,578 samples. -
Text Classification Dataset
The dataset used for text classification, which is a variant of the typical text classification model based on convolutional operation and max-pooling layer. -
Shakespeare dataset
Mobile crowdsensing has gained significant attention in recent years and has become a critical paradigm for emerging Internet of Things applications. The sensing devices... -
Multilingual Text Classification Dataset
Multilingual text classification dataset with 17 different languages