Toxicity Detection - Groups

PARLAI SINGLE ADVERSARIALbenchmark

The PARLAI SINGLE ADVERSARIALbenchmark dataset consists of single-turn conversations annotated based on offensiveness.

Dataset
JSON

PARLAI SINGLE STANDARDBenchmark

The PARLAI SINGLE STANDARDBenchmark dataset consists of single-turn conversations annotated based on offensiveness.

Dataset
JSON

PARLAI

The PARLAI dataset consists of single-turn conversations annotated based on offensiveness.

Dataset
JSON

From Detection of Toxic Spans in Online Discussions to Analysis of Toxic-to-C...

The ToxicSpans dataset is a subset of the Civil Comments dataset, containing toxic spans.

Dataset
JSON

Jigsaw Dataset

The Jigsaw dataset is a collection of text, where each text is labeled as toxic or non-toxic.

Dataset
JSON

RealToxicityPrompts

RealToxicityPrompts constitutes a collection of 100k naturally occurring sentences, amassed from various internet sources and designed to function as LM prompts.

Dataset
JSON