SST2, IMDB, Rotten Tomatoes

The SST2 dataset has 6920/872/1821 example sentences in the train/dev/test sets. The task is binary classification into positive/negative sentiment. The IMDB dataset has 25000/25000 example reviews in the train/test sets with similar binary labels for positive and negative sentiment. Similarly, the Rotten Tomatoes dataset has 5331 positive and 5331 negative review sentences.

Data and Resources

Cite this as

Soumya Sanyal, Xiang Ren (2024). Dataset: SST2, IMDB, Rotten Tomatoes. https://doi.org/10.57702/tk9gj8bo

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Defined In https://doi.org/10.48550/arXiv.2108.13654
Author Soumya Sanyal
More Authors
Xiang Ren
Homepage https://huggingface.co/datasets/SST2