Social Media - Groups

Monitoring CIFs During Disasters Using LLMs

The dataset used in this paper for monitoring Critical Infrastructure Facilities (CIFs) during disasters using Large Language Models (LLMs).

Dataset
JSON

Stanceosaurus: A New Corpus for Multicultural Misinformation Classification

Stanceosaurus is a new corpus of 28,033 tweets in English, Hindi, and Arabic annotated with stance towards 251 misinformation claims.

Dataset
JSON

Twitter Dataset

The Twitter Dataset is a collection of tweets annotated with Plutchik's emotions, consisting of tweets in three different languages: English, Dutch, and German.

Dataset
JSON

Twitter Data Dataset

The dataset used in this paper is a collection of Twitter data, including tweets, retweets, and replies.

Dataset
JSON

Twitter Social Media Dataset

The dataset used in this paper is a collection of social media data from Twitter, including user profiles, follow links, and tweets.

Dataset
JSON

ShareChat Video Posts Dataset

Dataset of video posts created in the Hindi language over a period of one week on the ShareChat application, capturing both implicit signals (such as video play, and skip) and...

Dataset
JSON

VAVD and UCNet

A new dataset for research on fake videos, and a deep learning based approach to identify fake videos with high accuracy using user comments.

Dataset
JSON

Twitter dataset for technological futures

The dataset was collected using the scraping library snsscrape (JustAnotherArchivist, 2023). Tweets were sourced from about 400 technology influencers’ feeds published in the...

Dataset
JSON

Epinions

The Epinions dataset is a large-scale opinion mining dataset. It contains 1 million user-item interactions and is widely used for evaluating the performance of recommender systems.

Dataset
JSON

Media Architecture Tweets

The dataset is a collection of tweets from architects, designers, engineers, community managers and policy makers interested in media architecture.

Dataset
JSON

Twitter, PHEME, and Weibo datasets

Twitter, PHEME, and Weibo are three real social media datasets from Twitter, PHEME, and Weibo.

Dataset
JSON

Berita Dataset

The Berita dataset consists of 50304 digital Indonesia news articles shared online through Twitter.

Dataset
JSON

Facebook Social Circles

The Facebook Social Circles dataset contains information about the social connections between users on Facebook.

Dataset
JSON

Emoji Diffusion on Twitter

The dataset contains English tweets from May 2018 to May 2022, used to analyze the diffusion of new emojis on Twitter.

Dataset
JSON

Reddit Million User Dataset

The Reddit Million User Dataset is a collection of 4 million comments from 400k different Reddit users.

Dataset
JSON

iPosts dataset

The independently posted tweets dataset (henceforth: iPosts) that we used for contradiction detection between independently emerging claim-initiating tweets.

Dataset
JSON

Threads RTE dataset

The dataset on which the authors run disagreement reply detection (henceforth: Threads) was converted by us to RTE format based on the threaded conversations labeled in this...

Dataset
JSON

Twitter

Dialogue systems – often referred to as conversational agents, chatbots, etc. – provide convenient human-machine interfaces and have become increasingly prevalent with the...

Dataset
JSON

Popularity Prediction of Social Media Posts

A dataset for popularity prediction of social media posts.

Dataset
JSON

Social Media Popularity Prediction

A dataset for popularity prediction of social media posts.

Dataset
JSON

99 datasets found