-
Aggrefact-Unified dataset
The Aggrefact-Unified dataset is a collection of news documents and summaries with factual errors. -
Rotten Tomatoes
The Rotten Tomatoes dataset has 5331 positive and 5331 negative review sentences. -
Sentence Reduction for Automatic Text Summarization
The dataset used in this paper for sentence reduction task. -
TL;DR: Mining reddit to learn automatic summarization
The authors used the TL;DR dataset, which consists of reddit posts with summaries. -
Document Summarization Dataset
The dataset used in the paper is a document summarization dataset. The goal is to extract sentences (with character budget B) to maximize coverage of human-annotated summaries. -
Text Summarization
The dataset used for the text summarization task, where a summarizer produces an utterance made up of one or multiple sentences to succinctly report the main content of a text. -
Multi-News
The dataset used in the paper is a collection of 45K news articles and corresponding summaries, where each summary is professionally crafted and provides links to the original... -
XSUM Dataset
The XSUM dataset comprises 226,711 British Broadcasting Corporation (BBC) articles paired with their single-sentence summaries. -
English Gigaword
Text summarization aims to extract essential information from a piece of text and transform the text into a concise version. -
Smart Reply and Ambient Clinical Intelligence
The dataset used for Smart Reply and Ambient Clinical Intelligence tasks