-
PARLAI SINGLE ADVERSARIALbenchmark
The PARLAI SINGLE ADVERSARIALbenchmark dataset consists of single-turn conversations annotated based on offensiveness. -
PARLAI SINGLE STANDARDBenchmark
The PARLAI SINGLE STANDARDBenchmark dataset consists of single-turn conversations annotated based on offensiveness. -
From Detection of Toxic Spans in Online Discussions to Analysis of Toxic-to-C...
The ToxicSpans dataset is a subset of the Civil Comments dataset, containing toxic spans. -
Jigsaw Dataset
The Jigsaw dataset is a collection of text, where each text is labeled as toxic or non-toxic. -
RealToxicityPrompts
RealToxicityPrompts constitutes a collection of 100k naturally occurring sentences, amassed from various internet sources and designed to function as LM prompts.