You're currently viewing an old version of this dataset. To see the current version, click here.

Twitter Multimodal Sarcasm Detection Dataset

The Twitter multimodal dataset consists of 24k samples of the tweet, image, and image attributes. The dataset is divided into the training set, validation set, and test set in the ratio 80%:10%:10%. The dataset is preprocessed to separate words, emoticons, and hashtags with the NLTK toolkit.

Data and Resources

Cite this as

Sundesh Gupta, Aditya Shah, Miten Shah, Laribok Syiemlieh, Chandresh Maurya (2024). Dataset: Twitter Multimodal Sarcasm Detection Dataset. https://doi.org/10.57702/e3wq2zfa

DOI retrieved: December 2, 2024

Additional Info

Field Value
Created December 2, 2024
Last update December 2, 2024
Author Sundesh Gupta
More Authors
Aditya Shah
Miten Shah
Laribok Syiemlieh
Chandresh Maurya
Homepage https://arxiv.org/abs/1907.11692