-
Blended Skill Talk (BST) dataset
Datasets used for training and testing dialogue models -
RealToxicityPrompts
RealToxicityPrompts constitutes a collection of 100k naturally occurring sentences, amassed from various internet sources and designed to function as LM prompts.