-
Social Media Corpus for Detecting Depression
A social media corpus for detecting depression -
PIPA (People In Photo Albums)
A large-scale dataset of social media photos crawled from Flickr, used for person recognition task in social media setup -
Buzz in Social Media
The Buzz in Social Media dataset contains information about social media buzz. -
Reddit Comments and Submissions
Reddit comments and submissions -
Reddit Politics and Fitness Communities
Reddit politics and fitness communities -
MemeTracker Dataset
The MemeTracker dataset contains the information flows captured by hyper-links between different sites with timestamps. -
Higgs Twitter Dataset
The Higgs dataset is a public dataset built by monitoring the spreading processes on Twitter before, during and after the announcement of the discovery of a new particle with... -
Breitfeller et al. (2019)
The dataset contains microaggressions in the form of social media posts. -
Tweet dataset
The dataset used in this paper is a collection of short texts, including tweets, Pascal Flickr captions, and search snippets. -
Twitter User Influence Score Dataset
The dataset contains 50,000 Twitter users with 19 features, including influence score, tweet credibility, sentiment score, and h-index score. -
Twitter Data
The dataset used in this study is a collection of Twitter data, containing all relevant tweets published for each stock. -
CLPsych 2015 Shared Task: Depression and PTSD on Twitter
A dataset created by Coppersmith et al. for the Computational Linguistics and Clinical Psychology (CLPsych) 2015 Shared Task. -
Monitoring CIFs During Disasters Using LLMs
The dataset used in this paper for monitoring Critical Infrastructure Facilities (CIFs) during disasters using Large Language Models (LLMs). -
Stanceosaurus: A New Corpus for Multicultural Misinformation Classification
Stanceosaurus is a new corpus of 28,033 tweets in English, Hindi, and Arabic annotated with stance towards 251 misinformation claims. -
Twitter Dataset
The Twitter Dataset is a collection of tweets annotated with Plutchik's emotions, consisting of tweets in three different languages: English, Dutch, and German. -
Twitter Data Dataset
The dataset used in this paper is a collection of Twitter data, including tweets, retweets, and replies.