-
Million User Dataset
The Million User Dataset (MUD) consists of all posts by authors who published at least 100 and at most 1000 posts between July 2015 and June 2016. -
Dataset II: Multilingual Forums
The dataset includes discussions from six popular subreddits (in English) and also discussions in French and German, demonstrating the utility of our approach to multilingual... -
Reddit Sarcoidosis Forum Dataset
The dataset analyzed in this study comprises threads and comments from the sarcoidosis forum on the social media platform Reddit. -
Reddit Comments and Submissions
Reddit comments and submissions -
Reddit Politics and Fitness Communities
Reddit politics and fitness communities -
Reddit Million User Dataset
The Reddit Million User Dataset is a collection of 4 million comments from 400k different Reddit users. -
Reddit News Topical Interactions
The dataset used in this study has been gathered from the Pushshift Reddit repository, containing archives of the entirety of Reddit posts and comments up to June 2021. -
Reddit data
Reddit social media data from 128 universities and colleges in the U.S. collected from 2019 to 2022 -
COVID-19 sentiment analysis using college subreddit data
Reddit social media data from 128 universities and colleges in the U.S. collected from 2019 to 2022