-
Tweet dataset
The dataset used in this paper is a collection of short texts, including tweets, Pascal Flickr captions, and search snippets. -
New York Times and 20Newsgroups datasets
The dataset used in the paper is the New York Times dataset and the 20Newsgroups dataset. -
20Newsgroups dataset
The 20Newsgroups data set is a dataset of 18,846 instances of newsgroup documents. -
Japanese Election Manifesto Data
The Japanese election manifesto data contains texts of Japanese election manifestos. -
Congressional Bills Project
The Congressional bills project dataset contains texts of congressional bills.