-
Metropolitan data
The Metropolitan data comes from a study of emotions expressed through Twitter messages posted from locations around Los Angeles County. -
Big data and big values: When companies need to rethink themselves
The dataset contains more than 94,000 tweets related to the core values of the firms listed in Fortune’s ranking of the World’s Most Admired Companies (2013-2017). -
Personality Traits and Echo Chambers on Facebook
The dataset contains 30K users who made more than 3M comments in a time span of 5 years (Jan 2010 — Dec 2014) on 413 US public Facebook pages supporting conflicting narratives —... -
GeoUK 2022 Tweets Dataset
A dataset of geolocated tweets in 2022, filtered to keep only tweets in the UK. -
Twitter Airline Data
Twitter Airline Data dataset contains sentiment values on a scale of 0-2. -
Blogger Dataset
The dataset used in this study is a large, industry-annotated dataset that contains over 20,000 blog users. -
Twitter OOV Word Dataset
The dataset is a collection of Twitter tweets, filtered to include only English language tweets. The dataset is used to study out-of-vocabulary (OOV) words in Twitter. -
Reputable News Index (RNIX)
The Reputable News Index (RNIX) dataset consists of retweet cascades linking articles from 28 reputable news publishers. -
Conroversial News Index (CNIX)
The Conroversial News Index (CNIX) dataset consists of retweet cascades mentioning articles from 41 online news publishers known for controversial content. -
Anonymous Twitter Dataset
The dataset used in this paper is a collection of tweets from Anonymous accounts and a random sample of non-Anonymous Twitter users. -
Media Frames Corpus
A dataset of annotated news articles and social media posts for frame classification. -
Tweet Judgement Classification of Rumours
The dataset used in the paper for tweet-level judgement classification of rumours in social media. -
Weibo Corpus
A dataset containing unstructured dialogues extracted from Weibo. -
Twitter Corpus
A dataset containing unstructured dialogues extracted from Twitter. -
Twigraph: Discovering and Visualizing Influential Words between Twitter Profiles
The dataset used in the paper is a collection of 1.1M tweets from Twitter, with approximately 3000 tweets per user from various domains such as politics, sports, entertainment,... -
Sina Weibo dataset
Sina Weibo dataset contains 226.8 million Weibo posts collected over the full course of 2012. -
TuDiabetes Forum
TuDiabetes Forum: We also collected a dataset from the TuDiabetes forum, a popular diabetes community operated by the Diabetes Hands Foundation. -
BGnow, TuDiabetes Forum
BGnow dataset is derived from diabetic users who actively share their wellness data on Twitter. TuDiabetes Forum: We also collected a dataset from the TuDiabetes forum, a... -
Diabetes Support Group, BGnow, TuDiabetes Forum
Diabetes Support Group dataset is collected from posts of users who follow and participate in diabetes support groups like “diabeteslife” or “diabetesconnect” on Twitter. BGnow...