-
RumourEval 2019: Determining Rumour Veracity and Support for Rumours
The dataset was collected from Twitter and Reddit tree-shaped discussions. The stance labels were obtained via crowdsourcing. -
Reddit Sarcoidosis Forum Dataset
The dataset analyzed in this study comprises threads and comments from the sarcoidosis forum on the social media platform Reddit. -
Reddit Comments and Submissions
Reddit comments and submissions -
Reddit Politics and Fitness Communities
Reddit politics and fitness communities -
Clean Corpus
The clean corpus contains a web scrape of 1.2 million reddit threads from 1,697 top subreddits. -
Reddit Conversation dataset
Reddit Conversation dataset -
Reddit Million User Dataset
The Reddit Million User Dataset is a collection of 4 million comments from 400k different Reddit users. -
Reddit News Topical Interactions
The dataset used in this study has been gathered from the Pushshift Reddit repository, containing archives of the entirety of Reddit posts and comments up to June 2021. -
one-million-reddit-questions
The dataset contains 500 questions from one million open-ended requests posted on AskReddit, and 129,483 of these questions were identified as asking for help. -
Reddit data
Reddit social media data from 128 universities and colleges in the U.S. collected from 2019 to 2022 -
COVID-19 sentiment analysis using college subreddit data
Reddit social media data from 128 universities and colleges in the U.S. collected from 2019 to 2022