Social Media - Groups

Knowledge Network and Social Media Based Reputation Management

The dataset used in this research is a collection of employee knowledge network and personal reputation on social media.

Dataset
JSON

Twitter datasets

The dataset used in this paper for controversy detection in social media.

Dataset
JSON

Finding function in form: Compositional character models for open vocabulary ...

A character-level encoder for social media posts trained using supervision from associated hashtags.

Dataset
JSON

Tweet2Vec: Character-Based Distributed Representations for Social Media

Text from social media provides a set of challenges that can cause traditional NLP approaches to fail. Informal language, spelling errors, abbreviations, and special characters...

Dataset
JSON

SemEval

The dataset used for stance detection on social media, incorporating moral foundations.

Dataset
JSON

Online Media Monitor (OMM) dataset

The Online Media Monitor (OMM) from the University of Hamburg contributed with a dataset of 5,236,660 unlabeled tweets gathered from June 21, 2022, to December 8, 2022.

Dataset
JSON

Million User Dataset

The Million User Dataset (MUD) consists of all posts by authors who published at least 100 and at most 1000 posts between July 2015 and June 2016.

Dataset
JSON

Parler

The dataset is used to study the political biases of entities and hashtags on Twitter. It contains tweets from politicians, news outlets, and other verified Twitter accounts.

Dataset
JSON

TIMME

The dataset is used to study the political biases of entities and hashtags on Twitter. It contains tweets from politicians, news outlets, and other verified Twitter accounts.

Dataset
JSON

Twitter Ideology-detection via Multi-task Multi-relational Embedding

The dataset is used to study political biases of entities and hashtags on Twitter. It contains tweets from politicians, news outlets, and other verified Twitter accounts.

Dataset
JSON

Characterizing Diabetes, Diet, Exercise, and Obesity on Twitter

The dataset contains 4.5 million tweets related to diabetes, diet, exercise, and obesity.

Dataset
JSON

Dataset II: Multilingual Forums

The dataset includes discussions from six popular subreddits (in English) and also discussions in French and German, demonstrating the utility of our approach to multilingual...

Dataset
JSON

Slashdot

Slashdot is a technology news platform where users can create friend (positive) and foe (negative) links to other users.

Dataset
JSON

English Tweets Dataset

The dataset for English Tweets, used as the source domain for Domain Adaptation.

Dataset
JSON

CCTT14 Dataset

The CCTT14 dataset is a collection of 994 labeled texts, where each text is annotated with one of 14 categories.

Dataset
JSON

CCTI14 Dataset

The CCTI14 dataset is a collection of 18,966 labeled images, where each image is annotated with one of 14 categories.

Dataset
JSON

WeiboScope Dataset

The WeiboScope dataset tracks about 120,000 users from three samples: high-viral potential users, censored users, and random users. The dataset includes 64,022 censored posts...

Dataset
JSON

Twitter and YouTube Interactions Dataset

The dataset contains 14,133 users with 12,148,994 tweets and 254,659 YouTube video interactions.

Dataset
JSON

CovidMis20

The CovidMis20 dataset contains around 1,375,592 tweets from February to July 2020, which can be used to develop automatic fake news detection models.

Dataset
JSON

SoMoSiMu-Bench: A Benchmark for Social Movement Simulation

A Twitter-like environment and a benchmark SoMoSiMu-Bench for simulation and evaluation of social media user simulation.

Dataset
JSON

101 datasets found