-
Microsoft Academic Graph
The Microsoft Academic Graph (MAG) dataset is used to construct Maple, a multi-field benchmark for evaluating scientific literature tagging. -
Reuters21578
The problem of similarity search is to find the most similar items in a large collection to a query item of interest. Fast similarity search is at the core of many information... -
Yahoo Answer and Yelp15 review
Two large scale document classification datasets: Yahoo Answer and Yelp15 review, representing topic classification and sentiment classification data sets respectively. -
Amazon
The dataset used in the paper is a series of datasets introduced in [46], comprising large corpora of product reviews crawled from Amazon.com. Top-level product categories on...