Twitter Multimodal Sarcasm Detection Dataset

doi:doi:10.57702/e3wq2zfa

You're currently viewing an old version of this dataset. To see the current version, click here.

Twitter Multimodal Sarcasm Detection Dataset

The Twitter multimodal dataset consists of 24k samples of the tweet, image, and image attributes. The dataset is divided into the training set, validation set, and test set in the ratio 80%:10%:10%. The dataset is preprocessed to separate words, emoticons, and hashtags with the NLTK toolkit.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Cite this as

Sundesh Gupta, Aditya Shah, Miten Shah, Laribok Syiemlieh, Chandresh Maurya (2024). Dataset: Twitter Multimodal Sarcasm Detection Dataset. https://doi.org/10.57702/e3wq2zfa

DOI retrieved: December 2, 2024

Additional Info

Field	Value
Created	December 2, 2024
Last update	December 2, 2024
Author	Sundesh Gupta
More Authors	Aditya Shah Miten Shah Laribok Syiemlieh Chandresh Maurya
Homepage	https://arxiv.org/abs/1907.11692