-
CUED_SPEECH at TREC 2020 podcast summarization track
Large English document corpus containing 100,000 podcasts -
Spotify dataset for podcast summarization
Podcast summarization dataset containing over 100,000 English podcasts -
Filtered Spotify Podcast Dataset
The dataset after filtering consists of 90,055 episodes. -
Spotify Podcast Dataset
The Spotify Podcast Dataset consists of 105,360 episodes with transcripts and creator descriptions, and is provided as a training dataset for the summarization task.