Toxic-DPO Dataset

The dataset used in the paper is the Toxic-DPO dataset, which is used for reinforcement learning from human feedback.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Unalignment (2024). Dataset: Toxic-DPO Dataset. https://doi.org/10.57702/eflhjtjl

DOI retrieved: December 2, 2024

Field	Value
Created	December 2, 2024
Last update	December 2, 2024
Author	Unalignment
Homepage	https://huggingface.co/datasets/unalignment/toxic-dpo-v0.2