Negative preference optimization: From catastrophic collapse to effective unlearning

doi:doi:10.57702/vjgwd5mx

Negative preference optimization: From catastrophic collapse to effective unlearning

A dataset for testing the effectiveness of unlearning methods in large language models.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Cite this as

Ruiqi Zhang, Licong Lin, Yu Bai, Song Mei (2025). Dataset: Negative preference optimization: From catastrophic collapse to effective unlearning. https://doi.org/10.57702/vjgwd5mx

DOI retrieved: January 3, 2025

Additional Info

Field	Value
Created	January 3, 2025
Last update	January 3, 2025
Defined In	https://doi.org/10.48550/arXiv.2407.01920
Author	Ruiqi Zhang
More Authors	Licong Lin Yu Bai Song Mei