Hateful Memes Dataset

The Hateful Memes Dataset consists of a training set of 8500 images, a dev set of 500 images & a test set of 1000 images. The meme text is present on the images, but also provided in additional jsonl files. To increase its difficulty, the dataset includes text- & vision confounders.

Data and Resources

Cite this as

Niklas Muennighoff (2024). Dataset: Hateful Memes Dataset. https://doi.org/10.57702/6v24xbxx

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Defined In https://doi.org/10.48550/arXiv.2012.07788
Author Niklas Muennighoff
Homepage https://github.com/Muennighoff/vilio