-
Monitoring CIFs During Disasters Using LLMs
The dataset used in this paper for monitoring Critical Infrastructure Facilities (CIFs) during disasters using Large Language Models (LLMs). -
Stanceosaurus: A New Corpus for Multicultural Misinformation Classification
Stanceosaurus is a new corpus of 28,033 tweets in English, Hindi, and Arabic annotated with stance towards 251 misinformation claims. -
Twitter Dataset
The Twitter Dataset is a collection of tweets annotated with Plutchik's emotions, consisting of tweets in three different languages: English, Dutch, and German. -
Twitter Data Dataset
The dataset used in this paper is a collection of Twitter data, including tweets, retweets, and replies. -
Twitter Social Media Dataset
The dataset used in this paper is a collection of social media data from Twitter, including user profiles, follow links, and tweets. -
ShareChat Video Posts Dataset
Dataset of video posts created in the Hindi language over a period of one week on the ShareChat application, capturing both implicit signals (such as video play, and skip) and... -
VAVD and UCNet
A new dataset for research on fake videos, and a deep learning based approach to identify fake videos with high accuracy using user comments. -
Twitter dataset for technological futures
The dataset was collected using the scraping library snsscrape (JustAnotherArchivist, 2023). Tweets were sourced from about 400 technology influencers’ feeds published in the... -
Media Architecture Tweets
The dataset is a collection of tweets from architects, designers, engineers, community managers and policy makers interested in media architecture. -
Twitter, PHEME, and Weibo datasets
Twitter, PHEME, and Weibo are three real social media datasets from Twitter, PHEME, and Weibo. -
Berita Dataset
The Berita dataset consists of 50304 digital Indonesia news articles shared online through Twitter. -
Facebook Social Circles
The Facebook Social Circles dataset contains information about the social connections between users on Facebook. -
Emoji Diffusion on Twitter
The dataset contains English tweets from May 2018 to May 2022, used to analyze the diffusion of new emojis on Twitter. -
Reddit Million User Dataset
The Reddit Million User Dataset is a collection of 4 million comments from 400k different Reddit users. -
iPosts dataset
The independently posted tweets dataset (henceforth: iPosts) that we used for contradiction detection between independently emerging claim-initiating tweets. -
Threads RTE dataset
The dataset on which the authors run disagreement reply detection (henceforth: Threads) was converted by us to RTE format based on the threaded conversations labeled in this... -
Popularity Prediction of Social Media Posts
A dataset for popularity prediction of social media posts. -
Social Media Popularity Prediction
A dataset for popularity prediction of social media posts.