-
DravidianCodeMix
The dataset is a collection of comments with homophobia and transphobia annotations, used for the task of homophobia and transphobia detection. -
Multilingual Offensive Language Identiļ¬cation Dataset (OLID)
The dataset is a multilingual offensive language identification dataset for social media, containing posts from Arabic, Danish, English, Greek, and Turkish.