-
AP News Corpus
The AP News corpus contains professionally-edited news articles and its vocabulary plateaus much faster than the Amazon corpus. -
AG News Dataset
The AG News - News articles from over 2000 news sources annotated by type of news: Sports, World, Business, and Science/Tech. 120k training and 7k test sets are provided. -
CNN/DailyMail and XSum
The CNN/DailyMail dataset is a collection of news articles, and the XSum dataset is a collection of news articles with summaries. -
AG's News Corpus
AG's News Corpus -
Reuters Dataset
The Reuters dataset is a text classification dataset containing 21,578 samples. -
CNN/DailyMail
A bus driver who was seriously injured when he was hit by a steam engine is making good progress, his wife has said. -
20NewsGroups
The dataset used in this paper is a collection of documents from various domains, including news, articles, and emails.